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(54) Synchronisation of data communication 

(57) A data communications system for communi- 
cating a data signal formed of successive data ele- 
ments, the system comprising a transmission node; a 
reception node; and a link providing a data connection 
from the transmission node to the reception node; in 
which: 

the transmission node comprises means for trans- 
mitting a synchronisation clocking signal to the re- 
ception node via the link, the synchronisation clock- 
ing signal having synchronising features occurring 
at a frequency lower than a data element rate; 
means for assembling elements of the data signal 
into data frames, each data frame having a plurality 
of successive data elements of the data signal, for 
transmission to the reception node via the link, the 
assembling means being responsive to the syn- 
chronisation clocking signal so as to set a synchro- 
nisation flag associated with a data element having 



a first predetermined temporal relationship with a 
synchronising feature of the synchronisation clock- 
ing signal; and the reception node comprises: 
means for detecting a synchronising feature of the 
synchronisation clocking signal received from the 
transmission node; means for disassembling re- 
ceived data frames to regenerate the data signal, 
the disassembling means being operable to detect 
a data element associated with a set synchronisa- 
tion flag; means for outputting a data element asso- 
ciated with a set synchronisation flag at a second 
predetermined temporal relationship with respect to 
the synchronising feature of the received synchro- 
nisation clocking signal; the first and second prede- 
termined temporal relationships being arranged so 
that a predetermined system latency exists be- 
tween input of a data element to the transmission 
node and subsequent output of that data element 
by the reception node. 
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Description 

[0001] This invention relates to data communication. 

[0002] An example of a problem in data communication will be described in the context of communicating so-called 
s Direct Stream Digital audio data. However, the present invention is applicable to other types of clocked data, such as 
multi-bit audio data or video data. 

[0003] Direct Stream Digital (DSD) is a high-resolution single-bit audio coding system used for the so-called Super 
Audio CD consumer disc format. DSD was developed with a view to producing audio signals comparable to those 
reproduced from the best analogue formats. DSD signals can produce a frequency response from DC to 1 00kHz and 

10 have a dynamic range of greater than 120 dB across the audio band. 

[0004] DSD makes use of 1 -bit digital audio. 1 -bit oversampling converters exploit a law of information theory whereby 
sample width can be traded off against sampling rate to effect conversion at a given resolution. For example a 1-bit 
converter that oversamples at 1 6 times the stored sample rate can give results which are equivalent to those obtainable 
with a 1 6 bit converter with no oversampling. 1 -bit oversampling converters (also known as Sigma-Delta, noise shaping 

15 or bit stream converters) measure the difference between successive audio samples rather than representing the actual 
value of the waveform amplitude. In DSD a significant improvement in reproduced sound quality is achieved by re- 
cording a high frequency (64F S ) 1-bit signal directly onto a super-audio CD rather than recording a 16-bit signal at 
frequency F s onto a CD using pulse code modulation. 

[0005] DSD systems require a high frequency audio sample clock at 64Fs = 2.8224MHz whereas the sample clock 
20 of standard PCM systems (Fs) is 44.1kHz. This high frequency sample clock is transmitted along with the data to 
facilitate accurate signal reconstruction at the receiving end. Furthermore each channel of 64Fs DSD audio requires 
a transmission bandwidth of 2.8224 Mbit/s. It is a problem to provide interconnections between large-scale multi-track 
production equipmentfor DSD audio such as multi-channel ADC/DACs, DSD mixers and multi-channel DSD recorders 
both because of the high audio bandwidth required for the audio data interconnection and because of the difficulty of 
25 transmitting the high frequency (64Fs ) audio sample clock between devices without compromising the integrity of the 
signal e.g. due to electromagnetic interference from the audio data signal. 

[0006] Several known audio networking systems make use of Ethernet to transmit high bandwidth audio-data be- 
tween a network of audio processing devices. For example the "Magic" system proprietary to Gibson makes use of 
the Ethernet Media Access Control MAC layer (i.e. physical layer and data link layer) to transmit audio data at a fixed 

30 audio sampling frequency of 48kHz using one Ethernet frame per sample period. The CobraNet audio networking 
system proprietary to Peak Audio also uses the Ethernet MAC layer to transmit uncompressed digital audio data be- 
tween networked devices. The CobraNet system uses a 48kHz sampling rate and allows for transmission of 20-bit and 
24-bit audio data. However, none of these known systems provides an interconnection suitable for linking DSD audio 
devices. This is because Ethernet frame timing is completely unsuitable for transmitting a 2.8224MHz DSD sample 

35 clock. 

[0007] This invention provides a data communications system for communicating a data signal formed of successive 
data elements, the system comprising a transmission node; a reception node; and a link providing a data connection 
from the transmission node to the reception node; 
the transmission node comprising: 

40 

means for transmitting a synchronisation clocking signal to the reception node via the link, the synchronisation 
clocking signal having synchronising features occurring at a frequency lower than a data element rate; and 
means for assembling elements of the data signal into data frames, each data frame having a plurality of successive 
data elements of the data signal, for transmission to the reception node via the link, the assembling means being 
45 responsive to the synchronisation clocking signal so as to set a synchronisation flag associated with a data element 

having a first predetermined temporal relationship with a synchronising feature of the synchronisation clocking 
signal; 

and the reception node comprising: 

50 means for detecting a synchronising feature of the synchronisation clocking signal received from the trans- 

mission node; 

means for disassembling received data frames to regenerate the data signal, the disassembling means being 
operable to detect a data element associated with a set synchronisation flag; 

means for outputting a data element associated with a set synchronisation flag at a second predetermined 
55 temporal relationship with respect to the synchronising feature of the received synchronisation clocking signal; 

the first and second predetermined temporal relationships being arranged so that a predetermined system 
latency exists between input of a data element to the transmission node and subsequent output of that data 
element by the reception node. 
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[0008] The present invention use the physical layer of a link (e.g. an Ethernet link) to provide a data communication 
system for transmission of clocked digital data such as DSD data. Note that the present invention is also applicable to 
bit stream audio data other than DSD data such as one-bit, delta-sigma modulated audio data at a variety of sample 
rates or floating point data formats. The advantages of using the physical layer of Ethernet for such data transmission 
are that it offers a large bandwidth, has proven electromagnetic compatibility and has error detection functionality (cyclic 
redundancy checks) already in place. Use of the physical layer makes the logic easy to design and implement. There 
is no need to be concerned with hardware addressing and implementation of windowing protocols as would likely be 
required if the audio data were encoded using higher layer (e.g. MAC layer) technology. Furthermore at the physical 
layer level, Ethernet data transmission is robust and spectrum controlled so that electromagnetic emissions are low. 
The invention provides an elegantly simple and convenient way of synchronising the two ends of such a link which is 
applicable to the transmission of one-bit or multi-bit data. 

[0009] The invention also provides a transmission node for use in a data communications system for communicating 
a data signal formed of successive data elements having a reception node; and a link providing a data connection from 
the transmission node to the reception node; 
the transmission node comprising: 

means for transmitting a synchronisation clocking signal to the reception node via the link, the synchronisation 
clocking signal having synchronising features occurring at a frequency lower than a data element rate; and 
means for assembling elements of the data signal into data frames, each data frame having a plurality of successive 
data elements of the data signal, for transmission to the reception node via the link, the assembling means being 
responsive to the synchronisation clocking signal so as to set a synchronisation flag associated with a data element 
having a first predetermined temporal relationship with a synchronising feature of the synchronisation clocking 
signal. 

[0010] The invention also provides a reception node for use in a data communications system for communicating a 
data signal formed of successive data elements having a transmission node and a link providing a data connection 
from the transmission node to the reception node; 
the reception node comprising: 

means for detecting a synchronising feature of the synchronisation clocking signal received from the transmission 
node, the synchronisation clocking signal having synchronising features occurring at a frequency lower than a 
data element rate; 

means for disassembling received data frames to regenerate the data signal, the disassembling means being 
operable to detect a data element associated with a set synchronisation flag; 

means for outputting a data element associated with a set synchronisation flag at a second predetermined temporal 
relationship with respect to the synchronising feature of the received synchronisation clocking signal. 

[0011] The invention also provides a data communications method for communicating a data signal formed of suc- 
cessive data elements, the system comprising a transmission node; a reception node; and a link providing a data 
connection from the transmission node to the reception node; the method comprising the steps of: 

the transmission node: 

i. transmitting a synchronisation clocking signal to the reception node via the link, the synchronisation clocking 
signal having synchronising features occurring at a frequency lower than a data element rate; 

ii. assembling elements of the data signal into data frames, each data frame having a plurality of successive 
data elements of the data signal, for transmission to the reception node via the link, the assembling step being 
responsive to the synchronisation clocking signal so as to set a synchronisation flag associated with a data 
element having a first predetermined temporal relationship with a synchronising feature of the synchronisation 
clocking signal; 

and the reception node: 

i. detecting a synchronising feature of the synchronisation clocking signal received from the transmission node; 

ii. disassembling received data frames to regenerate the data signal, the disassembling step being operable 
to detect a data element associated with a set synchronisation flag; and 

iii. outputting a data element associated with a set synchronisation flag at a second predetermined temporal 
relationship with respect to the synchronising feature of the received synchronisation clocking signal; 
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the first and second predetermined temporal relationships being arranged so that a predetermined system latency 
exists between input of a data element to the transmission node and subsequent output of that data element by the 
reception node. 

[0012] The invention also provides a method of operation of a transmission node for use in a data communications 
5 system for communicating a data signal formed of successive data elements having a reception node; and a link 
providing a data connection from the transmission node to the reception node; 
the method comprising the steps of: 

transmitting a synchronisation clocking signal to the reception node via the link, the synchronisation clocking signal 
10 having synchronising features occurring at a frequency lower than a data element rate; and 

assembling elements of the data signal into data frames, each data frame having a plurality of successive data 
elements of the data signal, for transmission to the reception node via the link, the assembling step being responsive 
to the synchronisation clocking signal so as to set a synchronisation flag associated with a data element having a 
first predetermined temporal relationship with a synchronising feature of the synchronisation clocking signal. 

15 

[0013] The invention also provides a method of operation of a reception node for use in a data communications 
system for communicating a data signal formed of successive data elements having a transmission node and a link 
providing a data connection from the transmission node to the reception node; 
the method comprising the steps of: 

20 

detecting a synchronising feature.of the synchronisation clocking signal received from the transmission node, the 
synchronisation clocking signal having synchronising features occurring at a frequency lower than a data element 
rate; 

disassembling received data frames to regenerate the data signal, the disassembling step being operable to detect 
25 a data element associated with a set synchronisation flag; and 

outputting a data element associated with a set synchronisation flag at a second predetermined temporal relation- 
ship with respect to the synchronising feature of the received synchronisation clocking signal. 

[0014] Various other respective aspects and features of the invention are defined in the appended claims. Features 
30 from the dependent claims may be combined with features of the independent claims as appropriate and not merely 
as explicitly set out in the claims. 

[001 5] Embodiments of the invention will now be described with reference to the accompanying drawings, throughout 
which like parts are referred to by like references, and in which: 

35 Figure 1 shows the standard seven-layer Open Systems Interconnection (OSI) model for network protocol archi- 

tectures and sub-layers of the Ethernet physical layer; 
Figure 2 illustrates a known system for signal transfer in DSD systems; 

Figure 3 schematically illustrates a DSD interconnection according to an embodiment of the present invention; 
Figure 4 illustrates a star-configuration interconnection that can be formed between several individual items of 
40 DSD equipment; 

Figure 5 schematically illustrates an audio data transmission system according to an embodiment of the present 
invention; 

Figure 6 schematically illustrates how the 64F S audio sample clock signal is transmitted in parallel with the DSD 
audio data along different signal pairs of the category 5 cable; 
45 Figure 7 schematically illustrates reception of the high frequency audio sample clock in parallel with reception of 

the DSD audio data signal; 

Figure 8 schematically illustrates the signal path of the 64Fs DSD sample clock signal; 

Figure 9 depicts an embodiment of the invention in which the synchronisation of the physical layer device is adjusted 
such that it is an exact multiple of the audio sample clock frequency; 
so Figure 10 schematically illustrates a point-to-point audio device link in which one device acts as a clock master 

whilst the other device acts as a clock slave; 

Figure 11 is a flow chart which illustrates the sequence of events followed to establish a synchronised link between 
the master device and the slave device of Figure 8; 

Figure 12 schematically illustrates an apparatus in which multiple parallel links are used between two pieces of 
55 audio equipment in order to achieve a higher channel count than that achievable via a single point-to-point link; 

Figure 13 is a flow chart illustrating how the local clock signals F S (A) and F S (B) are employed to ensure that the 
outputs of two receivers are kept synchronous; 

Figure 1 4 schematically illustrates how audio data buffering is performed in the transmitter; 
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Figure 15 schematically illustrates how audio data buffering is performed at the receiver; 

Figure 16 schematically illustrates the data structure corresponding to a standard Ethernet frame; 

Figure 17 shows the structure of an audio data frame according to an embodiment of the present invention; 

Figure 18A shows the audio data frame format arranged as 384M-byte data words; 

Figure 18B schematically illustrates a 24 DSD channel frame format in which each frame comprises 368 data 
words including 352 DSD samples for 24 channels plus 88 bytes of auxiliary data; 
Figure 19 shows the control data format arranged as 26*4-byte data words; 

Figure 20 schematically illustrates the structure of each of the three 1 6-bit frame format field sections corresponding 
to the frame format of Figure 18B; 

Figure 21 schematically illustrates the three 4-nibble sections of the frame format ID containing a set of data entries 
to be processed at the receiver; 

Figure 22 schematically illustrates the format of the 32-bit data block corresponding to the 24 DSD channel frame 
format of Figure 18B; 

Figure 23A schematically illustrates how six parity bits P0 to P5 are generated from 24 audio data bits and the two 
auxiliary data bits; 

Figure 23B schematically illustrates how a syndrome is calculated by performing XNOR operations on the received 
data elements; 

Figure 24 is a table showing a the composition of a stream of nibbles from the interleaver for the 24 DSD channel 
frame format of Figure 1 8B; 

Figure 25 schematically illustrates the protocol layers of the MAC-DSD protocol for the particular example embod- 
iment using the 24 DSD channel frame format- 
Figure 26A schematically illustrates the AES3 sub-frame format; 

Figure 26B schematically illustrates the sub-frame format for PCM transmission according to the present technique; 
Figures 27A to D schematically illustrate how three different indications S, Z and V are 

multiplexed using the M-bit of Figure 26B; 
Figures 28A to E schematically illustrates circumstances in which the S-bit takes precedence over the Z-bit in the 
M-bit of the sub-frame format according to Figure 26B; 

Figure 29 is a table 1 0 defining a frame type value index for a each of a number of different frame formats including 
frame types having different numbers of PCM samples per frame; 

Figure 30 is a table specifying the information derivable from the flag bits of the frame format of Figure 18B; 
Figure 31 specifies how values for the two flag bits associated with the base clock are interpreted; 
Figure 32 schematically illustrates how a multiplexed clock signal is formed in dependence upon a 64Fs signal 
and a word clock signal; 

Figure 33 schematically illustrates five consecutive DSD samples and their timing relationship with the local 64Fs 
clock and the word clock; 

Figure 34 schematically illustrates a MAC DSD transmitter adapted for transmission of both PCM and DSD data; 
Figure 35 schematically illustrates a MAC DSD receiver adapted for reception of both PCM and DSD data; 
Figure 36 schematically illustrates a system in which two sample-synchronous links are operated in parallel and 
in which the Fs/n sync is used to synchronise the parallel links; 

Figure 37 schematically illustrates a measured difference in propagation delay between the two parallel links of 
Figure 27; 

Figure 38 schematically illustrates a state diagram representing how switching is controlled to achieve a correct 
polarisation for the crossover circuit; 

Figure 39 schematically illustrates a Linear Feedback Shift Register (LFSR) used to create a pseudo-random 
sequence; 

Figure 40 schematically illustrates an arrangement for audio clock transmission according to a development of the 
present technique; 

Figure 41 schematically illustrates an arrangement for audio clock reception according to a development of the 
present technique; 

Figure 42 schematically illustrates a synchronisation signal physical arrangement according to the development 
of the present technique; 

Figure 43 schematically illustrates an audio data frame format according to the further development of the present 
technique; 

Figures 44A to F schematically illustrate frame format identifier contents of the audio frame format of Figure 43; 
Figure 45 schematically illustrates the frame payload formatting process according to the present technique; 
Figures 46 A and B schematically illustrate two different sync sequences of audio stream synchronisation markers; 
Figure 47 schematically illustrates a logical channel frame segmentation process; 

Figure 48 schematically illustrates a 26-bit logical channel slice comprising 24 bits of audio data and 2 bits of 
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auxiliary data; 

Figure 49 schematically illustrates how the auxiliary data bits (two per LC slice) are multiplexed over two logical 
channels; 

Figure 50A schematically illustrates a convolutional scrambler and Figure SOB schematically illustrates a corre- 
sponding convolutional descrambler; 

Figure 51 schematically illustrates an auxiliary data protocol hierarchy; 

Figure 52 schematically illustrates synchronisation connections between two transceivers; 

Figure 53 schematically illustrates a hardware arrangement used to achieve audio latency control using the audio 

stream synchronisation markers (ASSM) according to the development of the present technique. 



[0016] As described above, some known audio networking systems use the data link layer of Ethernet for transmis- 
sion of uncompressed digital audio data at standard sampling frequencies of around 48kHz. By way of contrast, em- 
bodiments of the present invention use the physical layer of Fast Ethernet to provide a point to point connection for 
transmission of high frequency (2. 8224MHz) digital audio data. The advantages of using the physical layer of Fast 

'5 Ethernet for audio data transmission are that it offers a large bandwidth, has proven electromagnetic compatibility and 
has error detection functionality (cyclic redundancy checks) already in place. Use of the physical layer makes the logic 
easy to design and implement. There is no need to be concerned with hardware addressing and implementation of 
windowing protocols as would likely be required if the audio data were encoded using higher layer (e.g. MAC layer) 
technology. Furthermore at the physical layer level, Ethernet data transmission is robust and spectrum controlled so 

20 that electromagnetic emissions are low. 

[0017] In order to explain the principles by which the present embodiments operate, the layered structure of network 
protocol architectures and the lower layers of the Ethernet architecture will be described in detail below. 
[0018] Figure 1 shows the standard seven-layer Open Systems Interconnection (OSI) model for network protocol 
architectures. The model comprises an application layer 270, a presentation layer 260, a session layer 250, a transport 

25 layer 240, a network layer 230, a data link layer 220, and a physical layer 21 0. 

[0019] The application layer 270 provides a user interface, usually in the form of an application program, to a range 
of distributed information services on the network. The services provided by this layer include file transfer, access and 
management, as well as general document and message interchange services such as electronic mail. 
[0020] The presentation layer 260 is concerned with the representation of data during transfer between two commu- 

30 nicating application processes. It selects an appropriate transfer syntax to be used during a transaction, so that the 
structure of the messages being exchanged between two application entities is maintained. The presentation layer 
260 also manages data encryption and data compression. 

[0021 ] The session layer 250 establishes sessions between communicating applications on communicating network 
nodes. It may optionally provide interaction management during two-way alternate i.e. half-duplex (rather than two- 
35 way simultaneous i.e. full-duplex) data exchange. Further optional features provided by this layer are synchronisation 
for lengthy network transactions and exception reporting. 

[0022] The transport layer 240 acts as an interface between the higher application-oriented layers (session 250, 
presentation 260 and application 270 layers) and the underlying network-dependent protocol layers 210, 220, 230. 
The transport layer provides the session layer with a defined set of message transfer facilities. It offers a number of 
40 classes of services appropriate to different types of network, ranging from class 0 which provides basic connection 
establishment to class 4 which provides full error control and flow control. 

[0023] The lowest three layers (network 230, data link 220 and physical layers 21 0) of the OSI model are all network 
dependent. The network layer 230 is responsible for establishing and clearing a connection between two transport 
layer protocol entities and it supports network routing and addressing. The data link layer 220 provides the network 

45 layer with a reliable information transfer facility and is responsible for such functions as error detection and message 
retransmission. Typically both a connectionless and a connection-oriented service is provided. The connectionless 
service simply discards received frames in which an error is detected whereas a connection-oriented service aims to 
provide an error-free information transfer facility. Finally, the physical layer 21 0 provides the data link layer 220 with a 
means of transmitting a serial bit stream between two pieces of equipment. It converts the data into the stream of 

50 electric or analogue pulses that will actually cross the transmission medium and it oversees the transmission of data. 
[0024] Ethernet is a local area network (LAN) technology, which uses a simple or branching bus-like connection line. 
The transmission medium in an Ethernet network is formed from one or more continuous lines of cable linked by hubs. 
Network devices are connected to the cable and they compete for network access using a Carrier Sensing Multiple 
Access with Collision Detection (CSMA/CD) protocol. According to the CSMA/CD protocol, all client devices monitor 

55 the transmission medium and wait until the transmission line is available before transmitting any messages. If two 
network nodes try to transmit messages at the same time, a collision occurs. The client devices then stop, wait for a 
random time interval and attempt to transmit again. 

[0025] Standard Ethernet systems known as 1 0BASE-T systems provide transmission speeds up to 10 Mega bits 
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per second (Mbps) whereas so-called "Fast Ethernet" (or 1 0OBASE-T) systems provide transmission speeds of up to 
1 00 Mbps. Further higher performance systems are available such as so-called "Gigabit Ethernet'. Fast Ethernet uses 
the same wiring systems, Media Access Control (MAC) method and frame methods as 10BASE-T Ethernet. The em- 
bodiments may use any of these systems. 

5 [0026] Ethernet systems may use twisted pair cabling or an optical fibre connection. Twisted pair is standard copper 
wire that is typically used to connect computers to a telephone link. To reduce cross-talk or electromagnetic induction 
between pairs of wires, two or more insulated wires are twisted around each other. The twisting reduces the effective 
radiating area of the cable because electro-magnetic effects of alternate twists tend to cancel at distances greater than 
the twist pitch. Each connection on twisted pair requires two wires. If the twisted pair is enclosed by a shield that 

10 functions as a ground it is known as shielded twisted pair (STP). Standard twisted pair cabling is known as unshielded 
twisted pair (UTP). 

[0027] In Fast Ethernet systems the segment length for twisted pair cable segments is set to a maximum of 100m 
to ensure that signal round-trip timing specifications are met. The problem with Fast Ethernet is how to achieve a data 
transfer rate of 100Mbit/s over unshielded twisted-pair cable (UTP). In practice there are two standards that can be 

15 used to achieve this, one of which (1 00BASE-4T) uses voice-grade category 3 cable and another (1 00BASE-X) which 
uses either high-quality category 5 UTP cable, shielded twisted-pair cable (100BASE-TX) or optical fibre 
(100BASE-FX). In the 100BASE-X system each type of transmission medium requires a different Physical Medium 
Dependent (PMD) sublayer. Category 5 UTP comprises 4 signal pairs, two pairs of which are typically utilised for 
Ethernet i.e. one signal pair forelock transmit and receive and one signal pair for data transmit and receive. This leaves 

20 two unused signal pairs. 

[0028] The sub-layers of the Ethernet physical layer and data link layer are shown alongside the seven layer OSI 
model. 

[0029] The data link layer 220 comprises the Media Access Control (MAC) layer 224 and the Logical Link Control 
(LLC) layer 222. The physical layer comprises a reconciliation sub-layer 21 9, a Media Independent Interface (Mil) 218, 
25 a physical coding sub-layer 216, a physical medium attachment sub-layer 214, a physical medium dependent sub- 
layer 212 and a Medium Dependent Interface (MDI) 211. 

[0030] The MAC sub-layer 224 performs the two main functions of data encapsulation and media access manage- 
ment. The data encapsulation functionality includes data framing, handling of source and destination addresses and 
detection of physical medium transmission errors. The medium access management functionality includes medium 

30 allocation (collision avoidance) and contention resolution (collision handling). 

[0031] The MAC sub-layer 224 can operate either in half-duplex mode or in full duplex mode. In half-duplex mode, 
network nodes contend for use of the physical medium using multiple access (CSMA/CD) algorithms. The full duplex 
mode allows for simultaneous transmission and reception without interference. For the full duplex mode to be used 
three conditions must first be satisfied. Firstly, the physical medium must be capable of supporting simultaneous trans- 

35 mission and reception without interference. Secondly there must be exactly two nodes on the local area network so 
that the physical medium is treated as a full duplex point-to-point link between the nodes. The use of CSMA/CD algo- 
rithms is unnecessary in this full duplex case because there is no contention for use of a shared medium. The third 
condition is that both network nodes must be configured to use full duplex operation. 

[0032] The Logical Link Control (LLC) layer 222 performs error-checking functions on data frames and manages 
40 links between communicating network nodes. 

[0033] The Reconciliation 219 sublayer maps the signal set provided at the Media Independent Interface 218 to the 
Physical Coding Sublayer 21 6. 

[0034] The Physical Coding Sub-layer (PCS) 21 6 provides a uniform interface to the Reconciliation sub-layer for all 
100BASE-TX physical layer entity (PHY) implementations. The PCS 216 provides all services required by the Mil 

« including: encoding of Mil 4-bit "data nibbles" to 5-bit code groups (and also decoding from 5-bit to data nibbles); 
generation of carrier sense and collision detect indications; serialisation of code-groups for transmission on the under- 
lying PMA sub-layer 21 4 (and de-serialisation of code groups on reception from the PMA214); and mapping of transmit, 
receive, carrier sense and collision detection between the Mil 218 and the underlying PMA 214. 
[0035] The Physical Medium Attachment (PMA) sub-layer 214 provides a medium-independent means for the PCS 

so to support the use of a range of physical media. The 1 00BASE-TX PMA performs the functions of: mapping of transmit 
and receive code-bits between the underlying Physical Medium Dependent (PMD) sub-layer 212 and the PCS 216; 
and generating a control signal indicating the availability of the PMD 212 to a PCS 216. The PMA sub-layer 21 4 may 
optionally: generate indications of carrier errors from the underlying PMDsub-layer212; sense receive channel failures; 
and transmit far-end fault indications. 

55 [0036] The PMD sub-layer 212 is effectively a set of signalling standards that define 125Mbit/s full duplex signalling 
systems, which accommodate multi-mode optical fibre (F), shielded twisted pair (STP) and unshielded twisted pair 
(UTP) wiring. 

[0037] The purpose of the Media Independent Interface (Mil) 218 is to provide a simple interconnection between the 
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MAC sub-layers 222, 224 and the physical layer entities (PHYs) for data transfer at 10Mbit/s and 100Mbit/s. The 
functionality is identical at both data rates, as are the signal timing relationships. The only difference between 10Mbit/ 
s and 100Mbit/s operation is the nominal clock frequency. The Mil 218 is used to provide media independence for 
various forms of unshielded twisted-pair wiring, shielded twisted-pair wiring, fibre optic cabling and potentially other 

s media, so that identical MACs may be used with any of these media. The Mil 218 maximises media independence by 
cleanly separating the Data Link Layer 220 and the Physical Layer 210 of the OS! seven-layer reference model. The 
data and delimiters of the Mil 218 are synchronous to clock references and the Mil uses Low Voltage Transistor- 
Transistor Logic (LVTTL) signal levels compatible with common integrated circuit processes. The Mil 218 provides 
independent 4-bit wide data-transmit and data-receive paths and full duplex operation. Each direction of data transfer 

10 is serviced with 7 signals: a 4-bit data bundle, a 1-bit delimiter signal, a 1-bit error signal and a 1-bit clock signal. 

[0038] Figure 2 illustrates a known system for signal transfer in Direct Stream Digital systems. The apparatus 300 
comprises an analogue-to-digital/ digital-to-analogue (ADC/DAC) converter 310 connected to a DSD multi-channel 
recorder 320. The connection comprises two separate cables: a first cable 315 is an optical fibre carrying 8 channels 
(about 22.6Mbit/s) of DSD audio data and a second cable 325 carries the high frequency sample clock. It is standard 

*5 studio practice to use separate cables for the audio data and the sample clock 

[0039] Figure 3 schematically illustrates a DSD interconnection according to an embodiment of the present invention. 
In this arrangement 400, a single cable 405 is used to connect a multi-channel ACD/DAC 410 to a DSD multi-channel 
recorder 420. The cable 405 is a category 5 unshielded twisted pair cable. This cable has four signal pairs, two pairs 
of which are used to transmit and receive audio data, encoded using Ethernet physical layer technology and the re- 

20 maining two pairs of which are used to convey a DSD sample clock in both directions across the link (see Table 1 
below). The clock signal and the audio data signal are conditioned to decrease the likelihood of interference between 
the two signals degrading the quality of the clock signal. The clock signal is used to synchronise a phase locked loop 
(PLL) in the receiving device, which in turn may be used as a sample clock for ADCs and DACs. Any jitter on the 
sample clock is undesirable since it will manifest itself as distortion on the reproduced analogue audio output. The 

25 audio signal is intrinsically digital and consequently more robust to degradation than the clock signal. A packet data 
transmission system such as Ethernet is capable of carrying the DSD audio data. In this particular embodiment, the 
physical layer of Fast Ethernet (1 00BASE-TX) is used to provide a channel bit-rate of 1 0OMbit/s which accommodates 
audio data from 32 DSD channels on a single link. In an alternative embodiment the 1 0OMbit/s link is used to support 
24 DSD channels on a single link. 

30 [0040] Ethernet is an asynchronous data link and is thus inherently unsuitable for transmission of the high-integrity, 
64F S audio clock signal. For this reason the audio sample clock is transmitted on separate signal pairs of the category 
5 UTP cable. 

[0041] The single cable connection in Figure 3 is fundamentally a point to point link directly connecting the two audio 
devices. It uses a special "crossover" category 5 cable that is wired to reverse the input/output connections. In this 
35 case a custom made crossover cable is required because conventional crossover cables such as those used for office 
networking do not reverse the two spare signal pair connections used in this embodiment for transmission of the audio 
sample clock. 

[0042] In alternative embodiments of the invention, such as that illustrated in Figure 4, more complex interconnections 
can be formed between several individual items of DSD equipment. The apparatus illustrated in Figure 4 comprises a 

40 star-configuration DSD router 430, a multi-channel ADC/DAC 440, a DSD mixer 450 and a DSD multi-channel recorder 
460. Three point-to-point links 445, 455 and 465 are connected together via the central DSD router 430. Unlike the 
connection of Figure 3, standard category 5 cable can be used for each of the three connections in this star configu- 
ration. This is because the port connections on the router are internally reversed such that signal outputs of one device 
connect to signal inputs of another device. 

45 [0043] The router 430 comprises a number of signal transceivers, each transceiver comprising a data clock trans- 
mitter (described below with reference to Figure 6) and a data and clock receiver (described below with reference to 
Figure 7). Switching and routing functions are carried out by a crosspoint switch (not shown) acting on the recovered 
clock and streamed audio data. In other words, signals are not transferred across the router in packetised form. 
[0044] The cable 405 linking the transmitter device to the receiver device in Figure 3 is terminated with 8-terminal 

so RJ45 plugs and both transmitter and receiver devices are fitted with RJ45 sockets. The table below specifies the setting 
of the RJ45 socket terminal connections for the audio devices of Figure 3 and for the star-configuration router devices 
of Figure 4. 



Table 1 



Pin number 


Function (audio device) 


Function (star-configuration router) 


1 


Data transmit + 


Data receive + 
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Table 1 (continued) 



w 



Pin number 


Function (audio device) 


Function (star-configuration router) 


2 


Data transmit - 


Data receive - 


3 


Data receive - 


Data transmit - 


4 


Clock transmit + 


Clock receive + 


5 


Clock transmit - 


Clock receive - 


6 


Data receive + 


Data transmit + 


7 


Clock receive - 


Clock transmit - 


8 


Clock receive + 


Clock transmit + 



15 [0045] Figure 5 schematically illustrates an audio data transmission system according to an embodiment of the 
present invention. The apparatus 500 comprises a first audio processing device 510 and a second audio processing 
device 520 linked by a category 5 unshielded twisted pair cable 515. Each audio processing device comprises a Field 
Programmable Gate Array (FPGA) 512, a physical layer interface (PHY) 514, a transformer 516 and an RJ45 8-pin 
connector 51 8. The FPGA 512 provides a Multichannel Audio Connection for DSD (MAC-DSD). 

20 [0046] 1 -bit 64Fs direct stream digital data is supplied from the audio device to the FPGA 51 2. During a transmission 
~~ operation the FPGA 512 performs audio data buffering~and framing operations whereas during data reception the 
FPGA extracts data from the framed structure and converts it back to a DSD stream. The FPGA performs transmission 
and reception concurrently, implementing afull-duplex audio connection. The format of the data frames will be described 
in detail below with reference to Figures 1 5 and 1 6. The PHY device 514 performs physical layer coding of the framed 

25 audio data, implements spectrum control processing and has line drivers that amplify the current and hence the power 
of the signal to increase its robustness during transmission. The PHY device 514 effectively implements the Physical 
Coding Sublayer (PCS), Physical Medium Attachment (PMA) and Physical Medium Dependent (PMD) sub-layers of 
the physical layer 210. In this embodiment the PHY device 514 is an Intel™ LXT972a component and it operates in 
full duplex mode with no auto-negotiation and with data scrambling on. The transformer 51 6 outputs the data for trans- 

30 mission on the category 5 cable 515. On reception the transformer 516 receives the signal prior to physical layer 
processing. The interface between the FPGA 512 and the PHY device 514 is a Media Independent Interface (Mil). 
Thus the FPGA replaces the network address handling Media Access Controller (MAC) of the conventional Ethernet 
system. Multiple sample rates are supported and the system is able to accommodate potential developments towards 
higher DSD sample rates. Any change to the audio sample rate affects the way audio data streams are packed into 

35 data frames and this functionality is determined by circuitry in the FGPA 51 2. Provided that the physical layer link has 
sufficient bandwidth changes in the audio sample rate have no effect on the PHY device 514. 
[0047] Figure 6 schematically illustrates how the 64F S audio sample clock signal is transmitted in parallel with the 
DSD audio data along different signal pairs of the category 5 cable. As in Figure 5, the FPGA 512, the PHY device 
514 and the transformer 51 6 perform the audio data signal processing prior to its transmission on two signal pairs of 

40 the Category 5 UTP cable 515. The 64F S audio sample clock is supplied as input both to the FPGA, which performs 
framing and buffering, and to a low pass filter 552. The low-pass filter serves to reduce electro-magnetic emissions 
during transmission of the clock signal. The output of the low-pass filter 552 is supplied as input to a differential line 
driver 554 and is subsequently fed through a 10BASE-Ttype Ethernet transformer 556. The clock signal is fed via the 
RJ45 connector 518 onto a signal pair on the category 5 UTP cable 515 where it is transmitted in parallel with the 

45 audio data. Transmission of the audio sample clock signal is important since it enables the FPGA of the receiving 
device to resynchronise the received audio data and thus to reconstitute the DSD bitstreams. The category 5 UTP 
cable used in this embodiment of the invention has a characteristic impedance of 100 Ohms. Alternative embodiments 
may use screened twisted pair cable which gives enhanced electromagnetic compatibility (EMC) performance. Further 
alternative cable types that may be used include category 5e cable (for data rates of up to 250Mbit/s), category 6 cable 

so (suitable for Gigabit Ethernet or category 7 cable which allows even higher data transmission rates. 

[0048] The FPGA is only one solution to achieve the functionality required at the transmitter and receiver. Software- 
controlled general purpose microprocessors may of course be used, in which case the software could be provided by 
a storage medium (e.g. a read-only memory, flash memory, magnetic disk or optical disk) or a transmission medium 
(e.g. a network or the internet) 

55 [0049] Figure 7 schematically illustrates reception of the high frequency audio sample clock in parallel with reception 
of the DSD audio data signal. The parallel signals are received from the cable 515 at the RJ45 connector 522 of the 
receiving device. The DSD audio signal is received by a transformer 524 and is then supplied to a physical layer 
interface 526 followed by an FPGA 528 which unframes the data and produces a DSD bit stream. The DSD audio 
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stream is output from the FGPA according to a 64Fs clock signal 529 derived from the local phase locked loop of the 
receiving device. 

[0050] The received audio clock signal is supplied to a transformer 562 on arrival at the receiving device. The output 
of the transformer is supplied to a high pass filter 563 and then to a low pass filter 564, which is of the same type as 
the low pass filter 552 in the transmitting device. The low pass filter 564 in the receiver serves to remove any high 
frequency interference in the received signal, derived either from the audio data signal, which it travelled adjacent to 
along the cable 515, or from external sources. The output from the low-pass filter is supplied to a comparator 568 
where it is converted to a logic signal. The logic signal from the comparator is used to drive a local phase locked loop 
(PLL) circuit. A phase locked loop (PLL) is an electronic circuit that controls an oscillator so that it maintains a constant 
phase angle relative to a reference signal. In this case the received high frequency clock signal is the reference signal. 
The PLL circuit generates a local audio reference clock which is used for reproduction of the DSD audio data. 
[0051] Figure 8 schematically illustrates the signal path of the 64Fs DSD sample clock signal. As explained above, 
the DSD sample clock is transmitted in both directions via dedicated differential signal pairs in the category 5 UTP 
interconnection cable 515. The sequence of processing operations performed on the high frequency (64F S ) clock signal 
will now be described with reference to Figure 8. Special analogue conditioning of the sample clock signal is performed 
to facilitate its transmission on a signal pair of the UTP cable adjacent to the asynchronous data signal. The analogue 
conditioning reduces the severity of electromagnetic interference effects from the asynchronous data signal (or from 
external sources) which compromise the integrity of the high frequency sample clock signal. As schematically illustrated 
in Figure 8, the sample clock processing that occurs in the clock master system involves the low pass filter 552, the 
differential line driver 554 and the transformer 556. The sample clock processing chain in the clock slave system 
involves the transformer 562, a high pass filter 563 and the comparator 568. 

[0052] The input to the low pass filter 552 of the clock master is a 2,8224MHz (64Fs) logic signal 551 . The frequency 
tolerance of this signal is in accordance with the Grade 2 specification defined by the standards document AES1 1 -1 997. 
Accordingly the sample clock has a long-term frequency stability of +/-10 parts per million (ppm), with an external 
synchronisation range of +/-50ppm. The duty cycle of the sample clock in the range 40-60%. and a Low Voltage Tran- 
sistor-Transistor Logic (LVTTL) logic signal is used. 

[0053] The 64 Fs logic clock signal 569 output by the comparator 568 of the clock slave system is also a logic signal 
of frequency 2. 8224MHz (64Fs). This clock output signal 569 is not used to synchronise any digital audio components 
directly because the link 515 characteristics may well have introduced substantial jitter and asymmetry to the clock 
signal. Rather, the clock output signal is used exclusively to synchronise an edge-triggered phase locked loop (PLL) 
in the receiver system. The clock output signal 569 is carefully routed within the receiver to ensure that any noise and 
jitter on the signal does not couple into other high-quality clock signals. The PLL circuit (not shown) of the clock slave 
system is used to generate high quality audio clock signals for distribution throughout the receiving system. 
[0054] The low pass filters 552 , 564 in both the transmitting (clock master) system and receiving (clock slave) system 
are second-order low-pass Butterworth filters, each having a cut-off frequency fc=2.9MHz. 

[0055] The transmitter low-pass filter 552 attenuates high-frequency components of the clock signal that may oth- 
erwise cause interference with the adjacent audio data signals in the cable or cause excessive RF emissions from the 
cable. The receiver low-pass filter 564 on the other hand, removes high-frequency interference from the clock signal 
induced by either the adjacent high-frequency data signals or by external sources. 

[0056] The differential line driver 554 located in the transmitter generates a symmetrical output signal of differential 
peak-peak voltage 1 .5V-2.5V into 100 Ohms (the impedance of the category 5 UTP link). 

[0057] The transformers 556, 562 in both transmitter and receiver are 10Base-T Ethernet transformers having a 1 : 
1 turns ratio and line-side common mode chokes. 

[0058] The high-pass filter 563 in the receiver is a first-order high pass filter having a cut-off frequency fc = 500Hz. 
This filter removes low-frequency interference from mains supply sources, and blocks DC offset. This filter is imple- 
mented with a simple resistance- capacitance (R-C) combination. 

[0059] The comparator 568 in the receiver converts the filtered analogue clock signal from the low pass filter 564 
into a logic signal. In order to avoid or reduce noise-induced multiple edges a 2% hysteresis is used. 
[0060] Figure 9 shows an embodiment of the invention in which the synchronisation of the physical layer device is 
adjusted so it is an exact multiple (9*64F S ) of the audio sample clock frequency 64F S . The Ethernet standard specifies 
a 25MHz symbol rate for data transmission. 

[0061] It is conceivable that transmission of the 2. 8224MHz sample clock along the same category 5 UTP as a 
asynchronous 25Mhz audio data signal could result in undesirable degradation of the audio clock. Synchronising the 
audio data transmission with the sample clock may help to reduce the degradation of the high-quality audio clock 
signal. The apparatus shown in Figure 9 comprises a multiplier 572 which takes a 64F S clock signal as input and up- 
converts it in frequency by a factor of 9 using a phase locked loop. The output from the x9 multiplier 572 is input to 
the PHY device of the transmitter so that a 576F S (25.401 6MHz) audio data signal is generated. Accordingly, this 
embodiment uses a 25.401 6MHz symbol rate for audio data transmission rather than the standard 25MHz Ethernet 
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symbol rate. As a consequence of the increased symbol rate the channel bit rate increases from 100Mbit/s to 
101.6064Mbit/s. 

[0062] Therefore, this embodiment of the invention can potentially reduce degradation of the audio clock signal but 
this is at the expense of removing compatibility with the 25MHz symbol rate of standard Ethernet systems. 
[0063] Figure 10 schematically illustrates a point-to-point audio link in which one device acts as a clock master 600M 
whilst the other device acts as a clock slave 600S. Each of the audio processing devices comprises a clock source 
PLL 602M/ 602S, a clock receiver (Rx) 604M / 604S, a lock detect module 606M / 606S, a clock transmitter (Tx) 608M 
/ 608S, an audio input/output (I/O) system 61 0M / 61 OS and a switch 61 2M / 61 2S. The suffix M denotes a component 
associated with the master device 600M whereas the suffix S indicates a component associated with the slave device 
600S. DSD audio data passes along a UTP cable (not shown) which links the audio I/O system 61 0M of the master 
with that of the slave 61 OS. 

[0064] The category 5 UTP cable provides independent connections such that under normal operating conditions 
clock signals are transferred in both directions between two audio devices. However in an active link one of the devices 
must be designated clock master 600M and the other device is thus designated the clock slave 600S. The clock master 
transmitter 608M sends an audio clock signal 605M to the clock receiver 604S of the clock slave. The master clock 
signal 605M is used by the phase locked loop 602S of the slave to produce a synchronisation signal that is supplied 
to the slave audio I/O system 61 OS. The audio clock signal 605S that is sent from the slave transmitter 608S to the 
clock receiver of the master 604M is not supplied to the phase locked loop 602M of the master because the switch 
61 2M of the master is left in an open state. However the slave clock signal 606S is compared with the local master 
clock by the lock detect module 606M of the master device to detect synchronisation of the remote slave system. 
[0065] Figure 11 is a flow chart which illustrates the sequence of events followed to establish a synchronised link 
between the master device and the slave device of Figure 10. 

[0066] At stage 620 the transceiver of device B 600S is set to slave mode and the clock transmitter 608S is temporarily 
disabled (until the link is established and a lock state has been achieved). This acts as a safeguard against two slave 
devices attempting to synchronise each other with unpredictable consequences. 

[0067] At stage 630 the UTP cable is used to physically connect the master device 600M to the slave device 600S 
thereby establishing the link. On connection of the cable both the master device 600M and the slave device 600S 
detect that the link is currently valid. The master device begins transmitting the clock signal 605M but the slave device's 
clock transmitter 608 is temporarily disabled. 

[0068] At stage 640 the slave device's clock receiver 604S detects the incoming master clock signal 605M and feeds 
this to the local slave phase locked loop circuit 602S which locks to the incoming master clock signal. 
[0069] At stage 650 the slave device 600S detects the lock condition by comparing its local system clock with the 
incoming master clock signal 605M via the lock detect module 606S. Closing the switch 61 2S completes the circuit 
between the slave PLL 602S the slave clock receiver 604S and the slave lock detect module 606S and thus enables 
lock detection. Once the slave lock detect module 606S signals that lock with the master clock has been established, 
the slave clock transmitter 608S is switched from the disabled state to an enabled state and the slave device 600S 
audio buffers (located in the audio I/O system 61 OS) are reset. 

[0070] At stage 660 the master device clock receiver 604M receives the echoed clock signal from the recently enabled 
slave clock transmitter 608S and checks the phase of this echoed signal to verify that the slave device has synchronised 
correctly with the master clock signal 605M. If synchronisation has not been correctly established then audio trans- 
mission is not enabled. 

[0071] At stage 670, having established that the slave device is correctly synchronised the master device resets its 
audio buffers (located in the audio I/O system 61 0M) and enables audio data transmission, whereupon framed DSD 
audio data is sent along the UTP cable linking master and slave devices. 

[0072] The flowchart of Figure 1 1 describes the standard process of establishing synchronisation between the master 
device and the slave device. However, it may be the case that an attempt is made to establish a link between two audio 
devices, both of which have been set to slave mode. In this event, the clock transmitters of both devices are disabled 
at the point where the devices detect a valid data link and an indication is made to the operator that the link is not 
synchronised. The link conditions are indicated to the user via LED status indicators (not shown) located adjacent to 
the RJ45 cable connection ports. Table 2 below gives an LED status for each of a number of possible link conditions. 
In particular a red or yellow LED "on" status corresponds to a clock synchronisation failure of the type that would be 
encountered during an attempt to link two slave mode audio devices. 



Table 2 



LED status 


Condition 


No LED on 


No Ethernet PHY connection detected 
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Table 2 (continued) 



LED status 


Condition 


Red (or yellow) LED on 


Ethernet PHY connection detected, but clock synchronisation failed/not present/not 
locked. Audio transfer inhibited 


Green LED on 


Ethernet PHY connection detected, slave device has locked to master device clock, and 
link is active 


Both LEDs on 


(illegal indication) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



[0073] Figure 12 schematically illustrates an apparatus in which multiple parallel links are used between two pieces 
of audio equipment. Use of multiple links means a higher channel count is achieved than that achievable via a single 
point-to-point link. In this case two links are used to provide a total of 64 channels. A transmitter device 700A comprises 
a first transmitter 702, a second transmitter 704 and a clock generator 706. A receiver device 700B comprises a first 
receiver 712, a second receiver 714 and a clock generator 71 6. A first category 5 UTP cable 721 carries audio data 
channels 1 to 32 (or 1 to 24) and links the first transmitter 702 to the first receiver 712. A second category 5 UTP cable 
723 carries audio data channels 33 to 64 (or 25 to 48) and links the second transmitter 704 to the second receiver 71 4. 
[0074] When operating the apparatus of Figure 1 2, it is necessary to ensure that the DSD audio data streams output 
by the first receiver 71 2 are sample-synchronised with the DSD audio data streams output by the second receiver 71 4 
i.e. the samples from channels 1 to 32 (or 1 to 24) are synchronised with the samples from channels 33 to 64 (or 25 
to 48). The transmit and receive latencies of the PHY devices in the transmitters 702, 704 and in the receivers 712, 
714 mean that it is possible that the output of receivers 712, 714 could slip out of synchronisation by more than one 
DSD audio sample period (3.543 x1 0" 7 seconds). Manufacturer specifications for commonly used PHY devices indicate 
that combined transmit and receive latencies of the PHY devices could vary by up to 6x1 0 -8 seconds so that slippage 
of one DSD sample between receivers is conceivable. Any differences in the lengths of cables 721 and 723 will also 
affect synchronisation. 

[0075] As shown in Figure 1 2, the first and second transmitters 702, 704 of the transmitting audio system 700A use 
a common synchronisation reference clock signal F S (A) running at F s = 44.1 kHz. Similarly the first and second receivers 
712, 714 of the receiving audio system 700B use a common synchronisation reference clock F S (B) running at F s = 
44.1kHz. These two 44.1kHz synchronisation clock signals F S (A) and F S (B) have identical frequencies both having 
been derived from a 64Fs master clock signal, but their phases, being arbitrary, are unlikely to match. The arbitrary 
phases are due to F S (A) and F S (B) having been derived from the common 64Fs clock via independent clock dividers. 
The flow chart of Figure 13 illustrates how the signals F S (A) and F S (B) are employed to ensure that the outputs of 
receivers 712 and 714 (which have derived their audio data from separate link cables 721 and 723 respectively) are 
kept synchronous. 

[0076] At stage 730 of the flow chart of Figure 1 3, a communication link between the transmitting system 700A and 
the receiving system 700B is established. Each of the two transmitters 702, 704 awaits receipt of a clock edge from 
the local 44.1 kHz clock signal F S (A) and then transmits the first audio frame. The data frame is packed such that the 
first DSD sample is input synchronously with the clock edge. The flow chart of Figure 13 relates to an embodiment in 
which there are 32 channels of DSD audio. As shall be described in detail below with reference to Figure 1 8A, for the 
32-channel system each frame comprises 384 data words and words 1 3 to 382 each contain a 1 -bit DSD sample value 
for each of 32 channels (370 sample values per channel are contained in each frame). The first transmitter transmits 
the first audio frame corresponding to channels 1 to 32 whilst the second transmitter transmits the first audio frame 
corresponding to channels 33 to 64. Since in this embodiment each frame contains 370 samples and there are 64 
samples per Fs period, a coincident frame start (1 st DSD sample value output) and Fs-period start (Fs(A) clock edge) 
will occur every 370 x 64 samples. However, 370 and 64 have a common factor of 2 so a frame-start and F s period- 
start occur together every (370*64)/2 samples i.e. every 32 frames. Accordingly, the 1 st DSD sample value of the frame 
will be output synchronously with the local F S (A) clock edge for frames 1 , 33, 65, 97.. and so on. These particular 
frames have a specific bit flag in a "frame type" field (see Figure 16) of the data frame set to one. 
[0077] At stage 732 of the flow chart both the first receiver 712 and the second receiver 714 capture a phase count 
value (j = 1 or 2 corresponding to first and second receivers respectively) marking the point in time at which the first 
DSD sample value in the first received frame is ready for output. Note that at system start-up the receiver audio outputs 
are muted and transmitter audio outputs are only enabled once synchronisation of the 64Fs sample clocks has been 
verified by the master device. The time at which the receiver is ready to output the first DSD sample value will depend 
on the time taken for the slave device to achieve phase lock with the 64F S clock signal of the master device. It will also 
depend on the setting of the threshold level of a FIFO buffer of the particular transmitter. Each receiver derives the 
phase count value 4>j from a counter in the receiver which is clocked by the 64 F s local clock signal and reset by the 
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44.1kHz signal F S (B). 

[0078] At stage 734, a system controller (not shown) compares the phase count values, 0^ and <D 2 ,for each of the 
receivers and determines if they are identical. If & A = 4> 2 then the receivers are synchronised to within the same DSD 
sample period which is the desired condition. In this event the process proceeds to stage 738 where the audio outputs 

5 are unmuted. If however, <I> 1 * <J> 2 at stage 734 then the process proceeds to stage 736 where the system controller 
adjusts the buffer read positions of the receivers in an attempt to achieve synchronisation. The receiver that synchro- 
nised with the 64Fs master clock earliest (and hence received DSD audio data first) has its buffer read position adjusted 
to match the buffer read position of the latest synchronised receiver (which started to receive DSD data later). This 
buffer read position adjustment is equivalent to modification of the phase count values Oj such that they are both equal 

10 to the higher of the two compared phase counts. Only when synchronisation has been achieved i.e. when the phase 
count values of the receivers are identical will the audio outputs be enabled. 

[0079] The phase count values of the receivers are cross-checked for every flagged frame (first frame and every 
following 32 nd frame) to ensure that synchronisation of the receivers is maintained. Frames are transmitted every 
1 31 .25u,s so that flagged frames occur approximately every 4.2ms (32x 1 31 .25u.s). Any receiver synchronisation prob- 

15 lem should be detectable and correctable within this 4.2ms period. Stages 742, 744, 746, of Figure 13 show the check 
that is performed by the system controller for every flagged frame. At stage 742 the controller checks the modified 
phase count value for the current flagged frame and compares it with the final (possibly modified) recorded phase 
count value for the previous flagged data frame i.e. frame X-32. If the phase count values match then the system 
continues with audio data transmission at stage 746. If however the phase count values for the two flagged frames do 

20 not match, this indicates that the two receivers are not outputting the same audio sample value simultaneously and 
the process proceeds to stage 744 where the system controller initiates resetting of the data links in an attempt to 
restore proper synchronisation. When the data links are reset the receiver logic is put in a reset condition so that the 
process of stages 732 to 738 of Figure 11 is carried out. In alternative embodiments the data links are reset by adjust- 
ment of the buffer read positions, but in this case a buffer overrun/underrun would trigger a total reset of the link. Sample 

25 synchronisation slippage could occur, for example, due to a cable glitch. 

[0080] For the alternative 24 DSD channel embodiment, as shall be described in detail below with reference to Figure 
18B, each frame comprises 368 data words and words 15 to 366 contain 352 DSD samples for 24 channels plus 88 
bytes of auxiliary data. Each 32-bit sample comprises 1 -bit from each of the 24 DSD channels, 2 bits of auxiliary data 
and 6 check-bits. Bit 0 of each sample corresponds to the first logical audio channel whereas bit 23 corresponds to 

30 the 24 th logical audio channel. In this case the first transmitter transmits the first audio frame corresponding to channels 
1 to 24 whilst the second transmitter transmits the first audio frame corresponding to channels 25 to 48. Since in this 
embodiment each frame contains 352 samples and there are 64 samples per Fs period, a coincident frame start (1 st 
DSD sample value output) and Fs-period start (Fs(A) clock edge) will occur every 352 x 64 samples. However, 352 
and 64 have a common factor of 32 so a frame-start and F s period-start occur together every (352*64)/32 samples i. 

35 e. every alternate frame. Accordingly, in the 24 DSD channel embodiment the 1 st DSD sample value of the frame will 
be output synchronously with the local F S (A) clock edge for frames 1,3, 5, 7, 9.. and so on. It follows that every alternate 
frame will be a flagged frame and the phase count values of the receivers will be cross-checked every alternate frame. 
[0081] Figure 14 schematically illustrates how audio data buffering is performed in the transmitter. The buffering 
apparatus 800 comprises a First In First Out (FIFO) buffer 810 in series connection with a frame assembler 820. In 

^o operation, 32 channels of Direct Stream Digital 1 -bit sample data are continuously fed into the FIFO buffer at a rate of 
64Fs which corresponds to 90.31 68Mbit/s. When the occupation level of the FIFO buffer reaches a predetermined 
threshold level 815 a signal is generated by the system controller to initiate transmission of a new audio data frame. 
In response to this signal, the frame assembler assembles the frame preamble and headers, during which time incoming 
DSD samples continue to be buffered. As soon as the audio data payload assembly begins, the frame assembler starts 

45 to extract data from the FIFO. The rate at which data is extracted from the FIFO corresponds to the Ethernet trans- 
mission rate of 100 Mbit/s (or 1 01 .6064Mbit/s for embodiments in which the symbol rate is locked to 9* 64F S ). Since 
the FIFO is filling at a rate of 90.31 68Mbit/s and emptying at a rate of 100Mbit/s the net buffer occupation level will 
steadily decrease during this period. The predetermined threshold level 815 is set in dependence upon the data input 
rate, the data output rate and the frame size (370 1-bit samples for 32 channels) so that the buffer occupation level 

50 will be almost, but not quite, zero at the end of each frame transmission i.e. data from the next frame for transmission 
is present in the buffer. The fact that the transmitter buffer 81 0 is not completely empty by the time the frame transmission 
ends breaks the rules of the MAC. Once the frame transmission is complete the FIFO occupation level will increase 
rapidly until the threshold level is reached whereupon the frame transmission cycle will repeat. 
[0082] For a transmission system with an input data rate of 90.31 68Mbit/s, an output rate of 1 01 .6064Mbit/s and a 

55 (370 1-bit sample) (32 channel) frame capacity it can be shown that the minimum buffer size is 42 DSD samples and 
the corresponding minimum threshold level is 30 DSD samples. The audio latency introduce by this minimum size 
buffer is 14.9ui> (= 42 / 64Fs). 

[0083] Figure 15 schematically illustrates how audio data buffering is performed at the receiver. The receiver buffering 
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apparatus comprises a frame receiver 860 in series connection with a FIFO buffer 870. Audio data arrives (via the 
category 5 UTP cable) in framed format at the frame receiver 860 at a rate of 1 0OMbit/s (or 101 .6064Mbit/s for the 
9*64F S symbol rate). The frame receiver strips off the preamble and headers of each data frame and optionally performs 
a cyclic redundancy check (CRC) to verify the integrity of the received data. Unframed audio data is passed directly 

5 from the frame receiver 860 to the FIFO buffer 870. Audio data extraction from the FIFO starts immediately since there 
is no threshold level set in the buffer at the receiver. This ensures that near-zero receiver latency is achieved. The 
audio data frames contain a cyclic redundancy check word (CRC). The CRC algorithm, check word location and scope 
are as defined in IEEE802.3-2000 section 3.2.8. This 32-bit check word will generally detect any error within the frame. 
In known Ethernet systems a CRC is performed on each frame both at the transmitter and at the receiver. At the receiver 

10 complete frames are output only once the result of the CRC on that frame is determined. This results in substantial 
latency before the data is output at the receiver in known systems. According to the present technique, although the 
CRC check is still performed at the receiver, data is output from the buffer before the result of the CRC check is obtained. 
Error control is performed by decoding parity bits at a stage subsequent to data output at the receiver FIFO. In particular, 
error control is performed when data is extracted from the 32-bit data blocks prior to output as a 32 DSD channel audio 

15 stream. Unlike standard Ethernet systems, the MAC-DSD protocol according to the present technique does not support 
frame re-transmissions in case of an error, as this would require buffering of at least two 1 25 microsecond audio frames, 
increasing system latency to an unacceptable degree. Although the primary purpose of the IEEE802.3 CRC is to detect 
frame errors and thereby generate a retransmission request, the CRC is included for sake of compatibility. It will be 
appreciated that support for CRC-initiated MAC-DSD frame retransmission may be provided for applications requiring 

20 greater robustness at the expense of latency. Audio data is extracted from the FIFO at a continuous rate of 90.31 68Mbit/ 
s and because the data output rate is less than the data input rate, the FIFO gradually fills up as the frame is received. 
Once a complete frame has been received there will be an inter-frame latency time before reception of audio data from 
the next frame and the FIFO buffer will continue to empty (although not completely) during this idle period. 
[0084] In the event that the receiver buffer fills completely or empties completely an error signal will be sent to the 

25 system controller. In this event the system controller will mute the audio outputs because a completely full or empty 
buffer indicates that one of the following situations has arisen: data link has failed; transmitter has failed; or DSD master 
clocks have not been properly synchronised between transmitter and receiver. 

[0085] Figure 16 schematically illustrates the data structure of a standard Ethernet frame. The frame structure is 
defined in the IEEE 802.3 standard. As shown in Figure 16 the Ethernet frame comprises a preamble, a start frame 

30 delimiter, a destination address field, a source address field, a data length field, a data payload and a checksum. 
[0086] The preamble is 7 bytes long, each byte containing the bit pattern 10101010 and this is followed by a single- 
byte start frame delimiter S containing the bit pattern 10101011. The preamble and start frame delimiter are used for 
hardware timing purposes. The destination address field is 6 bytes long and specifies the physical address of the 
network adapterthat is to receive the frame. The source address field is 6 bytes long and contains the physical address 

35 of the network adapter that is sending the frame. The data length field is 2 bytes long and specifies the size of the data 
payload. The data payload is a variable length field which is a minimum of 46 bytes and a maximum of 1500 bytes 
long. The checksum field is 4 bytes long and contains a checksum value for the frame that is used to perform a cyclic 
redundancy check (CRC). The CRC is a common means of verifying data transmissions. The sending network node 
calculates a CRC value for the frame according to a predetermined algorithm and encodes it in the frame. The receiving 

40 network node then recalculates the CRC and checks the CRC field to see if the values calculated by the transmitter 
and the receiver match. Ifthevaluesdonot match this indicates that data has been lost or corrupted during transmission. 
This Ethernet frame will be passed to the Physical layer components where it will be converted to a bit stream and 
sent across the transmission medium. Note that slight variations of this Ethernet frame format exist. 
[0087] Figure 1 7 shows the structure of an audio data frame according to an embodiment of the present invention. 

45 The audio data frame has a total size of 1 536 bytes comprising: an 8 byte preamble (following which the physical layer 
will accept up to 1528 bytes of arbitrary data); a 6-byte field reserved for the destination MAC address (default value 
Oxffffff); a 6 byte field reserved for the source MAC address (default value 0x000000); a 2-byte data length field which 
specifies the number of bytes (always 1510 bytes) following this field but excluding the CRC; a 28-byte field reserved 
for networking headers; a 12-bit reserved field (as yet unallocated); a 4-bit frame type field which is used for example 

so for synchronisation purposes; an audio data payload of 1 480 bytes which holds 370 samples of 32 channel DSD audio; 
and a 4-byte CRC field containing a checksum. The CRC checksum procedure used in embodiments of the invention 
will be described below. The audio data frame structure illustrated in Figure 1 7 is of a form that allows for compatibility 
with Internet Protocol (IP) networks. Accordingly the audio data frame may be treated as a User Datagram Protocol 
(UDPyiP datagram for transmission over wider IP networks. UDP is a connectionless (best try) transport layer protocol. 

55 |n this particular embodiment only the physical layer is used. The MAC layer is not used so the MAC address fields 
are not actually required by the system. These fields are simply reserved and filled with default values to allow (potential 
later) compatibility with Local Area Networks (LAN) or UDP/IR 

[0088] The audio frame CRC validity check will now be described in more detail. All frames use a 4-byte CRC check 
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word, to verify the validity of the frame. The CRC algorithm, check word location and scope are similar to those defined 
in the standards document IEEE802.3-2000 section 3.2.8. 

[0089] According to the IEEE802.3 standard, the payload of a frame should not be passed on from the data link layer 
until the frame validity has been verified with the CRC. However, in the context of embodiments of the invention, this 
5 implies that the receiver would have to buffer an entire frame before starting to output the DSD audio bitstreams. Direct 
implementation of this standard would be undesirable, as it would increase the audio latency by 115u.s, from around 
25jis to 140u.s. 

[0090] The CRC is primarily used to check the validity of a data link between audio devices at system start-up. Link 
failures after start-up, such as a cable disconnection are indicated by a receiver error assertion from the PHY device, 
10 following which the audio output is muted. Since the link is a simple point-to-point connection, with deterministic, syn- 
chronised frame transmission and no collisions, other modes of failure are unlikely. 

[0091] Accordingly, a relatively simple CRC check is implemented in embodiments of the invention. The receiver 
audio outputs are muted on start-up, until the first received frame has been received in full and verified by its CRC. If 
the CRC check fails, the audio outputs remain muted, and an error condition indicated to the local system controller. 
*5 Following the verification of the first frame, the CRC is only be checked retrospectively. This allows audio data to be 
streamed out with near-zero receiver latency. The CRC is used only to alert a host processor that a CRC error has 
occurred. 

[0092] If an invalid audio data frame is encountered, it is theoretically possible for up to 131u.s of invalid audio data 
to pass, before the output is muted in response to the retrospective CRC test. However, in practice, a random external 
20 perturbation that corrupts PHY line symbols will cause invalid symbols, resulting in rapid assertion of a receiver error 
condition, which may be detected to mute the audio outputs. 

[0093] If use of a CRC check on every frame is considered necessary then each frame is buffered and verified using 
the CRC before outputting the DSD audio data. This is not a preferred option because it adds approximately 115u.s 
extra latency and substantially increases the receiver buffer hardware size. 

25 [0094] The 1 536-byte audio data frames illustrated in Figure 1 7 each have a transmit duration of 1 20.9u.s (at a symbol 
rate of 101 .6064Mbit/s). According to a particular embodiment of the invention, frames are transmitted at intervals of 
131.1u.s. A minimum inter-frame time of 96 bit periods is provided which leaves 8.25us of "link-time" between trans- 
mission of audio frames. This link-time is used to convey auxiliary frames containing control data. The maximum total 
size of a control data frame in this embodiment is 104 bytes. 

30 [0095] The structure of a control data frame is identical to that of the audio data frame shown in Figure 1 5, with the 
exception of the length of the data payload which is 1 480 bytes for the audio data frame but only 48 bytes for the control 
data frame. A control data frame is transmitted every 1 31 \is which provides a control data bandwidth of 2.9Mbit/s. The 
control data itself may comprise channel usage information, router control data and clock source control data. The 
control data will be transmitted from storage in a FIFO buffer at the transmitter and gathered in a FIFO buffer at the 

35 receiver before being routed to a system controller of the receiver. 

[0096] Figure 18A shows the audio data frame format for the 32 DSD channel embodiment which is arranged as 
384*4-byte data words. Similarly, Figure 1 9 shows the control data format for the 32 channel DSD embodiment arranged 
as 26*4-byte data words. In both Figure 1 8A and Figure 1 9, bit zero (B0) is transmitted first and bit31 (B31 ) is transmitted 
last. These audio data frames and control data frames are passed to and received from the Media Independent Interface 

40 (Mil) connection 218 that provides a link to the Ethernet physical layer devices. The Mil comprises a 4-bit wide transmit 
data bus and a 4-bit wide receive data bus each of which is clocked from the PHY at the link rate of 25MHz (or 
25.4016MHz). The Mil also has a transmit-enable signal input to initiate data transmission and a receive data valid 
signal output as well as other error and signal status indicators. 

[0097] Referring now to the audio data frame structure illustrated in Figure 1 8A it can be seen that the payload of 
45 the audio data frame contains 370 samples of 32-channel 64Fs DSD audio. These channels are multiplexed per-bit. 
Each 32-bit word represents one 64Fs DSD sample for 32 audio channels. Word 13 is the first DSD sample in the 
frame, and word 382 is the last. Bit 0 of an audio data word is always the single-bit sample data for channel 1 (the first 
channel in the system) whereas Bit 31 of an audio data word is always the single-bit sample data for channel 32 (the 
last channel in the system). Table 3 below indicates how successive samples for each channel are stored in the data 
so words of the audio frame. For example: bit 0 of word 13 is the channel 1 sample data, for the first DSD sample in the 
frame; bit 6 of word 14 is the channel 7 sample data, for the second DSD sample in the frame; and bit 31 of word 382 
is the channel 32 sample data, for the last DSD sample in the frame. 



Table 3 



Word 


Bit 31 


Bit 30 


• • « 


Bit 1 


BitO 


13 


Ch. 32, sample 1 


Ch. 31 , sample 1 


• ■ ■ 


Ch. 2, sample 1 


Ch. 1, sample 1 
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Table 3 (continued) 



Word 


Bit 31 


Bit 30 




Bit 1 


BitO 


14 


Ch. 32, sample 2 


Ch. 31 , sample 2 




Ch. 2, sample 2 


Ch. 1, sample 2 


» a • 


• • ■ 


• ■ • 


• ■ > 




■ ■ ■ 


382 


Ch.32, sample 370 


Ch. 31 , sample 370 


i ft ■ 


Ch. 2, sample 370 


Ch. 1, sample 370 



10 



15 



[0098] Although Table 3 above represents the frame format in 32-bits words, these are supplied to and from Ml I four 
bits (a nibble) at a time rather than a word (4-bytes) at a time. The sequence of nibbles supplied to the Mil for the single 
24 DSD channel frame of Figure 18B is as shown in Table 4 below. The start of the 14 th data 4-byte word (word 13) 
corresponds to the start of the 105 th 4-bit nibble (nibble 104). The column headings TXD and RXD in the table below 
refer to the MM transmit and receive data buses respectively, which transfer nibbles of data synchronously with a 25MHz 
(or 25.4016MHz) clock. 

[0099] Nibble 0 is the first nibble in the frame, and contains part of the preamble pattern (0x5). Nibble 104 is the first 
nibble of the audio data field (first nibble of word 13), and nibble 3063 is the last nibble of the audio data field (last 
nibble of word 382). 



Table 4A 



20 



nibble 


TXD(3)/RXD(3) 


TXD(2)/RXD(2) 


TXD(1)/RXD(1) 


TXD(0)/RXD(0) 


0 


0 


1 


0 


1 


1 


0 


1 


0 


1 


■ ■ i 


■ • * 


• • • 


in 


• • • 


104 


channel 4 sample 1 


channel 3 sample 1 


Channel 2 sample 1 


channel 1 sample 1 


105 


channel 8 sample 1 


channel 7 sample 1 


Channel 6 sample 1 


channel 5 sample 1 


106 


channel 12 sample 1 


channel 11 sample 1 


Channel 10 sample 1 


channel 9 sample 1 


• • • 


ft 4 ft 


» • • 




■ ■ • 


111 


channel 32 sample 1 


channel 31 sample 1 


Channel 30 sample 1 


channel 29 sample 1 


112 


channel 4 sample 2 


channel 3 sample 2 


Channel 2 sample 2 


channel 1 sample 2 


• ■ * 


■ ■ ■ 




• ■ « 




3062 


channel 28 sample 370 


channel 27 sample 370 


Channel 26 sample 370 


channel 25 sample 370 


3063 


channel 32 sample 370 


channel 31 sample 370 


Channel 30 sample 370 


channel 29 sample 370 



25 



30 



35 



40 



45 



50 



[0100] Figure 18B schematically illustrates the audio data frame format for the 24 DSD channel embodiment. In this 
case the frame comprises 368*4-byte data words. The payload of the audio data frame comprises 352 DSD samples, 
each sample comprising 1-bit from each of the 24 channels. Data words 15 to 366 contain the audio data payload. 
Words 2 to 4 are reserved for source and destination MAC addresses. Bits 0 to 1 5 of word 5 specifies the total number 
of bytes in the frame from the beginning of the length field onwards but excluding the CRC field, which in this case is 
1 446 bytes. Bits 1 6 to 31 of word 5, words 6 to 12 and bits 0 to 15 of word 1 3 are data fields reserved for UDP and IP 
parameters. These data fields facilitate optional use of UDP/IP. When UDP/IP operation is not required, the transmitter 
fills these fields with zeros. The receiver may ignore all these UDP/IP header fields, with the exception of the first four 
bits (bits 16 to 19 of word 5 in this case) which indicate the IP Version. The data entry in the IP version field is checked 
and an action is taken in correspondence with the determined value as specified in Table 5 below: 

Table 5 



IP Header Value 


Consequent Action 


0x0 


Process frame as normal (i.e. transmitter did not fill IP fields) 


0x4 


Process frame as normal (i.e. transmitter filled frame header fields according to IP version 4) 


any other 


Discard the frame 



55 
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[0101] The IP Version check is performed to ensure backwards compatibility of the current IP version 4 from future 
IP versions (i.e. IP version 6). Future IP versions may have different header lengths, and consequently the Frame 
Format ID fields may be located at a different position in the frame. The safeguard of checking the IP version field 
means that such a frame would be discarded by the receiver (due to having a value other than 0x0 or 0x4) which avoids 
the possibility of the frame being incorrectly interpreted due to the Frame Format ID fields not being in the expected 
location at words 13 and 14. 

[0102J Bits 16 to 31 of word 13 and bits 0 to 31 word 14 in Figure 18B are fields for specifying the MAC-DSD frame 
format. This 48-bit frame format field is logically divided into three distinct 16-bit (4-nibble) sections, each of which 
contains an identical set of frame format data on transmission. The same set of frame format data is repeated three 
times within a given frame to ensure that the frame format identifier is robust to transmission errors i.e. multiple copies 
of the data are sent to serve as an error protection mechanism. This data-repeat error protection mechanism has the 
advantage that it gives the required error correction capability given that 48 bits are available to convey 16 bits of 
information yet it is simple to implement. An alternative embodiment might use an error correction code such as a 
convolutional code to transmit the frame format ID payload. 

[0103] Each of the three 16-bit frame format field sections are structured as illustrated in Figure 20. The first nibble 
(bits 0-3) of each 16-bit section specifies the Protocol Minor Version (OxO-Oxf). The protocol minor Version field is 
used to indicate minor updates to the protocol specification. A more recent Minor Version should be fully backwards- 
compatible with a previous Minor Version associated with the same Major Version so that for example a Version 1 .7 
protocol must incorporate all the functionality of Version 1 .6 protocol, and a Version 1 .7 transceiver must be able to 
communicate fully with a Version 1.6 transceiver. The second nibble (bits 4-7) of each 16-bit section specifies the 
Protocol Major Version (OxO-Oxf), This field is used to indicate major updates to the protocol specification. Backwards- 
compatibility with previous Major Versions of the protocol is desirable but not mandatory. The third nibble (bits 8-11) 
of each 16-bit section specifies the Frame Type (OxO-Oxi). This field can be used to indicate different frame types 
used by a given version of the protocol. Within a given Major Version level, the definitions of frame types should be 
consistent. The basic type of audio frame is always Type 0. The table below specifies the information derivable from 
the Frame type number specified by bits 8 to 11 according to the described embodiment. 



Table 6 



Frame Type Number 


Name 


Description 


0x0 


DSD audio frame 


352 DSD (2.8224MHz) samples, 24-channel, plus 88 bytes aux data, 
(32,26) Hamming linear block code error correction, 256-nibble 
interleaving 


other 


(invalid) 


Invalid — reject frame 



[01 04] The fourth nibble (bits 12-15) of each 1 6-bit section contains one or more flags used for example to flag frames 
for synchronisation purposes as described above with reference to the flow chart of Figure 13. The definition of the 
flag bits is dependent upon the Major Version protocol level. The table below specifies the information derivable from 
the frame flag bits 12-15 according to the described embodiment. In particular bit 0 of the flags field is the 44.1 kHzsync 
flag. If flag 0 has a value 1 this indicates that the first DSD sample in frame was received at transmitter simultaneously 
with 44.1kHz sync clock positive edge whereas if bit 0 of the flags field has value 0, this indicates that the first DSD 
sample in frame was not received at transmitter simultaneously with 44.1 kHz sync clock positive edge. 



Table 7 



45 


Flag bit 


Name 


Description 




0 


44.1 kHz sync flag 


1 : First DSD sample in frame was received at transmitter simultaneously with 
44.1 kHz sync clock positive edge 


50 






0: First DSD sample In frame was not received at transmitter simultaneously with 
44.1 kHz sync clock positive edge 




others 


(not used) 


Set to 0 by transmitter, ignored by receiver 



[0105] Figure 21 schematically illustrates the three 4-nibble sections of the frame format ID containing a set of data 
entries to be processed at the receiver. Section 0 comprises nibble 0 (nO) to nibble 3 (n4), section 1 comprises nibble 
4 (n4) to nibble 7 (n7) and section 2 comprises nibble 8 (n8) to nibble 11 (n11). The manner in which the repetition of 
data sections is used at the receiver to reject data transmission errors will now be explained in the context of Figure 
21 . According to the present technique it is known that on transmission, each of the three sections should contain an 
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identical data set such that data entries in corresponding nibble positions of each of the three sections match. On 
particular it is expected that: nO = n4 = n8; n1 = n5 = n9; n2 = n6 = n10; and n3 = n7 = n11 . At the receiver triplets of 
corresponding nibbles are compared for equality, and a majority decision is taken as to the correct data value. Consider 
the example incoming receiver data set shown in Figure 21 . For the first triplet of nibbles it can be seen that nO = 1 1 01 b, 

5 n4 = 1101b, n8 = 1101b i.e. the corresponding nibble values are identical so the value is assumed to be correct and 
the first nibble of the Frame Format, which specifies the protocol minor version, is set to the value 1101b. Similarly, for 
the second triplet of nibbles n1=n5=n9=1110b so the value is assumed to be correct and the second nibble of the 
Frame Format, which specifies the protocol major version, is set to 1 11 Ob. However, for the third triplet of nibbles there 
is a discrepancy between the data values since n2 = n1 0=011 Ob but n6 = 1011b. In this case n6 is rejected as being 

10 erroneous on the basis of a majority decision so that the receiver and outputs the third nibble of the Frame Format, 
which corresponds to the frame type, as 0110b. For the fourth and final triplet of nibbles it can be seen from Figure 21 
that none of the corresponding nibbles match n3 = 0010b, n7 = 0111b, n11 = 1100b. In this case a majority decision 
is impossible so the frame format cannot be determined and consequently the frame is rejected. 
[01 06] An alternative embodiment uses a modified Frame Format error detection/correction strategy. This alternative 

is strategy also involves using the data repetition and majority decision approach but the strategy is augmented by using 
the 1 0OBase-TX PHY 'MN'receive error 1 (rx_er) signal to flag nibbles that are known to be in error. For example consider 
receiving the following values for the fourth triplet of nibbles with associated error flags as indicated: n3 = 1 000b (rx_er 
= true), n7 = 0100b (rx_er = false), n11 = 1000b (rx_er = true). In this case, although the majority decision determines 
that 1000b is the correct value, the rx_er signal indicates that n3 and n11 are definitely incorrect. Thus according to 

20 this alternative strategy the data vale n7 is selected in preference to n7 and n11 to give a Frame Format Flags value 
of 0100b. . _ 

[0107] Returning now to the frame data fields of figure 18B. the last word (word 367) of the 24 DSD channel data 
frame is a field containing cyclic redundancy check (CRC) data. 

[0108] Table 4B below identifies the sequence of nibbles supplied to the Mil for the single 24 DSD channel frame of 
25 Figure 18B. This sequence is transmitted via the nibble-wide Mil interface 21 8, starting with the least significant nibble. 
Nibbles 0 to 8 (32 bits) correspond to word 0 of Figure 1 8B, nibbles 8 to 1 5 correspond to word 1 of Figure 1 8B, nibbles 
16 to 23 correspond to word 2 of Figure 18B and so on until the last nibble which corresponds to bits 28 to 31 of word 
366. There are a total of 2936 nibbles (367 words) corresponding to the 1446 byte frame of Figure 18B since the last 
word is not transmitted as a nibbles. As mentioned above with reference to Figure 1 the Mil 218 interface provides 
30 independent 4-bit wide data-transmit and data-receive paths and full duplex operation. More particularly, the Mil 218 
comprises: a four-bit wide transmit data bus, clocked from the physical layer interface (PHY) 514, 526 at the link rate 
(25MHz or25.4016MHz); a transmit enable signal input; four-bit (nibble) wide receive data bus, clocked from the PHY 
at the link rate (25MHz or 25.401 6MHz); a receive data valid signal output; and error and signal status indicators. A 
full description of the Mil interface, can be found in IEEE8O2.3-2000 Section 22, but note that the clock rate according 
35 to the present technique may be 25.401 6MHz rather than the IEEE standardised 25.0000MHz. 



Table 4B 



40 



45 



Nibble 


Word (from Figure 18B) 


MIITXD(3) 


MM TXD(2) 


MIITXD(1) 


Mil TXD(0) 


0 


0 


Bit 3 


Bit 2 


Brt 1 


BitO 


1 


0 


Bit 7 


Bit 6 


Bit5 


Bit 4 


« * * 


• » • 


» « ■ 


• • * 


• • • 




7 


0 


Bit 31 


Bit 30 


Bit 29 


Bit 28 


8 


1 


Bit 3 


Bit 2 


Bit 1 


BitO 




• • * 










2934 


366 


Bit 27 


Bit 26 


Bit 25 


Bit 24 


2935 


366 


Bit 31 


Bit 30 


Bit 29 


Bit 28 



[0109] The nibble is the fundamental unit of data carried on the physical layer Each 4-bit nibble is mapped to a 5-bit 
symbol by the PHY 514, 526, for transmission on the signal line 515. All frames for transmission must begin with an 
eight-byte preamble pattern, following which the physical layer will accept up to 1 528 bytes of arbitrary data, supplied 
55 4 bits at a time. Received frames are supplied 4 bits at a time by the receive bus, including the preamble. 

[0110] The 24 DSD channel frame format of Figure 18B includes a frame payload of 352 DSD samples, each of 
which consists of a 32-bit data block. Figure 22 schematically illustrates the format of the 32-bit data block. Each data 
block corresponds to a single DSD sample period of approximately 354ns. The data block comprises a 24-bit audio 
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data vector each bit of which belongs to a respective one of the 24 audio channels, 2 bits of auxiliary data and 6 check 
(or parity) bits. As shown in Figure 22 bit numbers 0 to 1 4 contain bits 1 to 1 5 of the audio data vector, bit numbers 1 5, 
23, 27,29,30 and 31 contain the six parity bits, bit numbers 26 and 28 contain the two bits of auxiliary data and the 
remaining nine bits of the audio vector are contained sequentially in bit numbers 1 6 to 22, 24 and 25 of the data block. 

5 [01 11] The six parity bits of the 32-bit data block provide error control capability. The 24-bits of audio data plus the 
two auxiliary bits (totalling 26 bits) are encoded using a type of linear block code known as a Hamming code. In this 
case a (31 , 26) Hamming code is used, which means that 5 (=31-26) parity bits are generated by the code for each 
group of 26 data bits. The final bit of the 32-bit block is a global parity bit so there are a total of 6 parity bits and 26 
data bits. The (31 , 26) Hamming code is capable to detecting 2 errors per data block but is only capable of correcting 

10 one error per data block. 

[01 12] Figure 23A schematically illustrates how the six parity bits PO to P5 are generated from the 24 audio data bits 
(numbered 1-24) and the two auxiliary data bits AO, A1. Parity bits PO to P5 are generated by performing a logical 
XNOR operation on a predetermined sequence of 15 data elements. For example PO is generated by performing an 
XNOR operation on audio vector bits 1 through 15 whereas P1 is generated by performing an XNOR operation on 

'5 audio vector bits 1 to 8 and 1 6 to 22. Global parity bit P5 is obtained by performing the XNOR operation on all 26 data 
elements. The error detection process at the receiver involves determining whether the parity checks are satisfied in 
the received data sequence. This is done using a value known as the syndrome. Figure 23B indicates how the syndrome 
s is generated by XNOR operations on various combinations of the received data block elements. The syndrome is 
obtained by comparing the received parity bits and the parity bits recalculated from the received information. Table 8 

20 below indicates how the value of the syndrome is usedto detect and correct errors in the received data block. Essentially, 
if all 6 bits of the syndrome have value 1 (8=111111) then the received data sequence is assumed to be correct. If the 
sixth bit of the syndrome is zero then there is assumed to be a single error in the received data block, which is correctable 
by inverting the appropriate bit. The appropriate bit is identified from the value of the syndrome itself e.g. if s = 011011 
in binary notation, which corresponds to the decimal number 27 then it is determined that bit number 27 (of bits 0 to 

25 31 ) should be inverted to correct the data block. If the sixth bit of the syndrome is 1 but the other five bits are not all 1 
e.g. s =1 1 1 01 1 then this indicates that there are two or more errors in the block and the multiple errors are uncorrectable. 



Table 8 



s 5 


S 4 S 3 S 2 S 1 S 0 


Block status 


1 


11111 


No errors in block 


0 


other 


One error in block, identified by s 4 s 3 s 2 s 1 s 0 — correct error by inverting bit 


1 


other 


More than one error in block — not correctable 



35 

[0113] The 32-bit data blocks (see Figure 22) are interleaved in groups of 32, to facilitate correction of groups of 
errors. The interleaving process involves permuting the data in a predetermined way. This is required because the (31 , 
26) Hamming code used for each 32 -bit data block is only capable of correcting a single bit error in a given block. Since 
the fundamental unit of data on the physical layer the four-bit data nibble, a single instantaneous corruption on the 
40 physical layer will cause a symbol error (recall that a symbol is a 5-bit quantity), resulting in four consecutive bit errors. 
To facilitate correction of such 4-bit burst errors the erroneous bits must be distributed amongst four different 32-bit 
data blocks. 

[0114] Consider a stream of 352 32-bit data blocks BO, B1, B2, ...B351 emerging from the parity generator for trans- 
mission. Recall that the 24 DSD channel frame of Figure 18B comprises an audio data payload of 352 32-bit data 

4 5 blocks. The resulting stream of nibbles from the interleaver is comprised as shown in Figure 24. In this Figure the bits 
of the audio payload are labelled such that B2[0] refers to bit 0 of block 2, for example. Thus it can be seen that nibble 
zero comprises bit 0 of blocks 0, 1 , 2 and 3 respectively; nibble 1 comprises bit 0 of blocks 4, 5, 6 and 7 respectively 
and so on. Accordingly, nibbles 0 to 7 collectively comprise bit 0 of each of the thirty-two 32-bit data blocks, nibbles 8 
to 15 collectively comprise bit 1 of each of the thirty-two 32-bit data blocks and nibbles 2802 to 2815 comprise bit 31 

50 of each of the thirty-two 32-bit data blocks. The 32-block interleaving system used by MAC- DSD facilitates the correction 
of up to eight symbol errors (i.e. 32 bits can be corrected overall) in a group of 32 interleaved data blocks (256 nibbles 
or symbols). 

[0115] In summary, the version of the MAC-DSD protocol used for transmission of 24 DSD channels as described 
above with reference to Figures 18B and 20 to 23 has key features including: 24-channel, full-duplex transfer of 
55 2.8224MHz DSD audio; 1 0OBase-TX physical layer; audio latency of less than 50 microseconds; Hamming linear block 
code error correction, with 256-nibble interleaving, to correct up to 8 nibble errors per 256-nibble block group; 64fs 
DSD clock transfer in both directions; and frame flag indication for transfer of the 44.1 kHz sync signal. 
[0116] Figure 25 schematically illustrates the protocol layers of the MAC-DSD protocol for the particular example 
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embodiment using the 24 DSD channel frame format. On the transmitter side 1000 the protocol layers comprise a 
parity generating and formatting layer 1010 that receives the incoming 24 channel DSD audio stream and an auxiliary 
data stream of up to 5.6Mbit/s. This layer 1 01 0 generates six parity bits for each 24 audio bit and 2 auxiliary bit sample 
and formats the resulting 32-bit data block. The 32-bit data blocks output by the parity generating and formatting layer 
1010 are supplied to an interleaving layer 1020 that interleaves the data blocks in groups of 32 and outputs the inter- 
leaved data across the Mil 218 in 4-bit nibbles as specified in Figure 24. The nibbles of data from the interleaver are 
supplied to the FIFO buffer 810 of the transmitter at a continuous data rate of 90.31 68Mbit/s. The nibbles continue to 
fill the FIFO buffer 81 0 until the predetermined threshold buffer occupation level is reached (as described with reference 
to Figure 14) whereupon assembly of a data frame begins. During data frame assembly data nibbles are read out of 
the FIFO buffer 81 0 and passed to a frame assembly layer 1 040. The frame assembly process involves use of a header 
data generation module 1050 that generates frame header information and a CRC generation module 1060 that gen- 
erates data for the CRC field, which is word 367 of the frame format of Figure 18B. The frames are assembled such 
that they contain a 1408 byte payload of 352 DSD samples contained in 352 32-bit data blocks. Data from the frame 
assembly layer 1040 is output as Mil frames (which comprise nibbles) at a rate of 101 .6064Mbit/sec and supplied to 
the transmitter physical layer 1 070 which prepares the data for transmission across the physical medium. The trans- 
mitter physical layer 1 070 forms a 5-bit symbol from each 4-bit nibble and the symbols are transmitted to the receiver 
across a twisted-pair cable. On the receiver side 1100 a receiver physical layer 1110 receives the 5-bit symbols and 
processes them to form Mil frames comprising 4-bit nibbles. The Mil frames are supplied to a frame disassembling 
layer 1 1 20 at a rate of 1 01 .6064 Mbit/sec, which performs the CRC checks and strips off the header data for subsequent 
processing. The frame payload is output by the frame disassembling layer 1120 as Mil nibbles which are fed to the 
FIFO buffer 870 (as described above with reference to Figure 15) which has a low latency with regard to data output. 
Data is output from the FIFO buffer 870 in the form of Mil nibbles and passed to a deinterleaving layer 1160. The de- 
interleaver de-interleaves the data in groups of 32 data blocks to reconstruct individual 32 -bit data blocks of the format 
illustrated in Figure 22. The 32-bit data blocks are then passed to a parity decoding and data extraction layer 1170 
whereupon the parity data is used to perform error control and the recovered payload data is extracted. The output of 
this layer is a 24 channel DSD audio stream and an auxiliary data stream of up to 5.6Mbit.s Note that in Figure 25, 
although the FIFO buffers 810, 870 do not perform any data translation and therefore are not technically protocol layers, 
they are included in the schematic illustration of the protocol layer structure for completeness. 
[0117] Note that in the case of the 352 sample payload of the 24 DSD channel frame format of Figure 18B, the 
transmission buffer size and predetermined buffer occupancy threshold differs from the buffer size and occupancy 
threshold specified in the description of Figure 14 above for the 370 sample payload of the 32 DSD channel Frame 
Format of Figure 18A. In particular, for the 24 DSD channel frame format the minimum buffer size is 36 data blocks 
(rather than 42 data blocks) and the corresponding minimum occupancy threshold value is 30 data blocks (as before). 
The audio latency introduced by this buffering is equivalent to 36 DSD samples (rather than 42 samples) or 14.9 
microseconds (rather than 12.2 microseconds). 

[01 18] The above described system in which the physical layer of a link such as an Ethernet link is used to provide 
a data communication system for transmission of DSD data may also be used to transmitted other types of clocked 
digital data. In particular the system may be adapted to accommodate transmission of Pulse Code Modulated (PCM) 
digital data. The physical layer connection according to the present technique offers high bandwidth for communication 
of PCM data. 

[0119] PCM data is clocked at a much lower frequency (e.g. 44.1kHz-96kHz) than the 64Fs clocking frequency of 
DSD data. Accordingly, in order to support PCM transmission as well as DSD transmission a further lower frequency 
clock signal, which shall be referred to as the word clock signal, is communicated between networked devices along 
the twisted pair cable. The word clock is used for reproduction of PCM data at the receiver. The frame format for 24 
DSD channels as illustrated in Figure 18B allows for transmission of 352 bits of data per frame for each of 24 audio 
channels. Essentially, 352 24bit DSD samples (one bit per channel) are transmitted in a single frame. Data blocks are 
transferred over the link at an aggregate rate of 64 Fs, that is , 2. 8224MHz (=64*44.1 KHz) for 44.1 kHz based audio 
and 3.072MHz for 48kHz based audio. In orderto transfer PCM data in the desired frequency range of (44.1 kHz-1 2.5%) 
to (96kHz +12.5%) at the required data rates it is necessary to accommodate between 4 and 13 24-bit samples per 
frame. Accordingly, a number of alternative data formats are defined so that at the transmitter it is decided how many 
samples should be sent in the next frame and a look-up-table is consulted to select the appropriate one of the alternative 
data formats. Known PCM transmission formats such as I2S and AES3-1 992 and package PCM sample data in serial 
sub-frames. AES3 is an Audio Engineering Society standard for the serial transmission format for linearly represented 
digital audio data over conventional shielded twisted-pair conductors, of up to at least 1 00 m in length, without equal- 
isation. Figure 26A schematically illustrates the AES3 sub-frame format. Each AES3 frame is uniquely composed of 
two sub-frames and typically the rate of frame transmission corresponds exactly to the source sampling frequency. 
The first sub-frame starts with the preamble X but the preamble changes to Z once every 1 92 frames. This defines the 
block structure used to organise the channel status information. The second sub-frame always starts with preamble 
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Y. As illustrated in Figure 26A each AES sub-frame is 32-bits long in which :bits 0 to 3 contain a preamble ; bits 4 (least 
significant bit) to 27 (most significant bit) contain a 24-bit audio sample word; bit 28 is a "V" field which carries a validity 
bit associated with the audio sample word; bit 29 is a "IP field which contains one bit of a user data channel associated 
with the audio data channel transmitted in the same subframe; bit 30 is a "C" field or channel status field which carries 
one bit of channel status information associated with audio data transmitted in the same subframe; and bit 31 is a "P" 
field which carries a parity bit such that time slots 4 to 31 inclusive will carry and even number of ones and an even 
number of zeros i.e. even parity. The V bit is logic 0 if the audio sample word is suitable for conversion to an analogue 
audio signal and is logic 1 otherwise. The C bit is one-bit of channel status information specifying for example the 
length of audio sample words, number of audio channels, sampling frequency etc. Channel status information is or- 
ganised in 1 92-bit blocks sub-divided into 24 bytes. The first bit of each block is carried in the frame having preamble Z. 
[0120] Figure 26B schematically illustrates the sub-frame format for PCM transmission according to the present 
technique. This 27-bit sub-frame structure includes the U bit and C bit fields of the known AES3 sub-frame format to 
facilitate transparent transfer of AES3 format data across the physical layer link. As illustrated in Figure 26B, bits 0 to 
23 contain data, bit 24 contains the U bit, bit 25 contains the C bit and bit 26 contains an M bit. The U and C bits are 
taken directly from incoming AES3 data streams or from the user data and channel status buffer memory in the trans- 
mitter. The M-bit is a multiplexed bit specific to the present technique and may be used to contain any of the three 
following indications at certain points in the bitstream: an S bit which is a flag that identifies an external Fs/n (n integer) 
synchronised data sample and is repeated across all data channels every n periods of Fs; the Z bit that signals the 
start of the AES3 U/C data block which repeats on each channel every 192 samples; and the V bit which is the sub- 
sampled AES3 V-bit status. The S and Z indications are each used to identify particular samples within the audio data 
stream. Since the S and Z indications are by their nature periodic it should in principle be sufficient to simply specify 
their phase with respect to the sample dock. However, in practice the S and Z indications should be repeated moderately 
frequently to enable the link to lock rapidly on start-up and to detect any link failures in a timely manner. The M-bit 
marks an S indication with two consecutive logical 1's in the bitstream as shown in Figure 27B whereas a Z indication 
is marked by a single logical '1 ' as shown in Figure 27A. In order to indicate the sync sample (S-bit) by two consecutive 
logical '1' s in the M bit data stream a counter is provided at the transmitter to pre-empt the occurrence of the sync 
signal. The V-bit status is indicated for each channel in the bit immediately following the S indication. This implies that 
V is indicated less frequently than per-sample, but is indicated per-channel at intervals of the S sync (i.e. typically Fs/ 
2048, or about 46ms at 44.1kHz), and also assumes that V-bit status does not change rapidly, which is a reasonable 
assumption for the vast majority of audio applications. Figure 27C shows a V-bit indication which is a logical 1 (true), 
thereby signalling that channel samples are valid resulting in three consecutive logical 1 's in the bit stream (two 1 *s for 
the S-bit and 1 for the V-bit). Figure 27D shows a V-bit indication of 0 (false) immediately following the two logical 1 *s 
of the S-bit. This signals that the channel samples are invalid. Since the M-bit is used to indicate a number of alternative 
events it is likely that event indications will either coincide or be close enough in succession to interfere. For this reason 
priority is always given to S indications over Z indications. As a consequence of this Z indications will occasionally be 
missed so it is appropriate to maintain counts of the U/C block phases in the receiver in order to set the Z-bits in 
outgoing AES3 streams in these circumstances. Figures 28A to 28E give examples of relative occurrences of S indi- 
cations and Z indications and indicate whether or not the relative positioning requires that the Z indication be disabled. 
In Figure 28A the Z indication coincides with the second bit of the S indication so Z is disabled and only S is indicated. 
In Figure 28B the Z indication immediately precedes the S indication in the received M-bit sequence, in which case Z 
is disabled because S is imminent. Otherwise the three consecutive logical 1 's would be indistinguishable from the S 
and V indication of Figure 27C. In Figure 28C the Z indication precedes the S indication but is separated from it by a 
single bit period. Since Z and S are sufficiently separated so that they do not interfere so both the Z indication and the 
S indication are enabled here, in Figure 28D the Z indication immediately follows the S indication and could lead to 
ambiguity so the Z indication is disabled. In Figure 28D, the Z indication follows the S indication with a single bit-period 
separation. As for Figure 28C, Z and S are sufficiently separated so that they do not interfere so both the Z indication 
and the S indication are enabled here. 

[0121] For the purposes of transferring PCM data between devices on the physical layer the frame format is basically 
the same as the format described above in relation to Figure 1 8B. In particular, each frame is 1 472 bytes long and the 
data payload consists of 352 32-bit data blocks. Each 32-bit block comprises 24 audio data bits and two auxiliary data 
bits, which together form 26 independent bit-stream segments of 352 bits per frame. In PCM mode, each of the 24 
audio bitstreams is divided into a number of sample subframes which are separated by padding. The number of sub- 
frames varies from 4 to 13 in dependence upon the particular PCM sample frequency. This enables support for samples 
rates from 44.1 kHz - 12.5% to 96kHz + 12.5%. Each sample sub-frame contains data from a single PCM sample. 
[0122] For each possible number of sample subframes per bitstream segment, a specific arrangement of sample 
subframes and padding bits is defined. All padding bits should have the value 0. This determinism enables the receiver 
to correctly extract the sample subframes from the bitstream segment. These arrangements are shown in Table 9A. 
Table 9B gives a specific example of the subframe arrangement for the case of 9 sample subframes per frame. 
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Table 9A 



Number of samole 

1 v w> III mm ■ ^m m 111 aw I %v 

subframes 


Paddina bits after each 
subframes 


Final oaddina bits at end of bitstream 
segment 


9 


12 


1 


10 


8 


2 


11 


5 


0 


12 


2 


4 


13 


0 


1 



15 



Table 9B 
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25 



30 



35 



40 



45 



Element 


Bits 


sample subframe 1 of 9, bit 0 first 


27 


padding 


12 


sample subframe 2 of 9, bit 0 first 


27 


padding 


12 


sample subframe 3 of 9, bit 0 first 


27 


padding. _ . - 


12 


sample subframe 4 of 9, bit 0 first 


27 


padding 


12 


sample subframe 5 of 9, bit 0 first 


27 


padding 


12 


sample subframe 6 of 9, bit 0 first 


27 


padding 


12 


sample subframe 7 of 9, bit 0 first 


27 


padding 


12 


sample subframe 8 of 9, bit 0 first 


27 


padding 


12 


sample subframe 9 of 9, bit 0 first 


27 


padding 


12 


final padding 


1 


Total 


352 



[0123] Accordingly, the data block audio bit usage for the frame format of Figure 1 8B in PCM mode differs from the 
audio bit usage in DSD mode. A further difference in the frame format in PCM mode relative to DSD mode relates to 
the Frame Format ID values contained in the three identical frame ID sections in words 13 and 14 of Figure 18B. The 

so frame format ID fields of each section were outlined above with reference to Figure 20. In summary, each frame format 
ID section comprises a flags field, a frame type field, a protocol major version field and a protocol minor version field. 
To accommodate PCM mode, the frame type field values are extended relative to those defined in Table 6 above. As 
specified in the table of Figure 29, 10 new frame type values have been defined corresponding to the 10 different 
possibilities (integers in the range 4 to 13) for the number of sample subframes per frame. Two separate formats for 

55 the frame flags field of the frame format ID (see words 13 and 14 of Figure 18B and Figure 20) have been defined : 
one format for DSD frames and another format for PCM frames. The table of Figure 30 shows the flags field format 
for a DSD frame. In this case flag bit 0 indicates whether or not the first DSD sample in the frame was received simul- 
taneously with the 44.1 kHz sync clock positive edge whereas flag bit 1 indicates whether or not the first DSD sample 
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in the frame was received simultaneously with the Fs/n sync clock positive edge. The tables of Figure 31 show the 
flags field format for a PCM frame. In this case flag bits 0:1 specify the frequency of the audio base clock whereas flag 
bits 3:2 specify the base clock sample rate multiplier. The sample rate can be specified to be 1 , 2, 4 or 8 times the 
base clock frequency Fs. 

[0124] The PMC frame format described above relate to an example embodiment in which 24 audio channels are 
accommodated. An alternative embodiment may involve accommodating 48 audio channels in 1 Fs PCM mode (sample 
rate 44. 1 kHz or 48 kHz ± 1 2.5%). In this case two audio channels are multiplexed onto each bitstream. The multiplexing 
may be implemented per sub-frame or per bit. 

[0125] The clock and synchronisation functions of the PCM mode will now be considered in detail. As mentioned 
above, for transmission of PCM data across the network a word clock is required in addition to the 64Fs MAC-DSD 
cable clock. Rather than sending two separate clock signals across the twisted pair cable, the 64Fs clock and the word 
clock are multiplexed. The multiplexing process involves modulating the word clock signal onto the 64Fs clock signal 
by shifting at least one edge of the 64Fs clock pulse i.e. by generating a "clock pulse width deviation". The clock pulse 
width deviation acts as a phase indicator signal for the word clock, which is embedded in the 64Fs clock. The clock 
pulse width deviation is formed by identifying certain transitions in the 64Fs clock signal which are determined to be 
coincident with the word clock transitions at the transmitter. Since in this embodiment the positive-going transitions of 
the 64Fs clock are used for timing synchronisation, the phase of the word clock is encoded by shifting the positions of 
negative-going 64Fs clock transitions. In particular, where a word clock and 64Fs clock transitions positive-going tran- 
sition coincide, the preceding negative-going transition of the 64Fs clock is shifted to produce a multiplexed clock 
signal. Figure 32 schematically illustrates how the multiplexed clock signal is formed in dependence upon the 64Fs 
signal and the word clock signal. In Figure 32 the uppermost signal 3210 is the unmodified 64fs clock signal used to 
synchronise the PLL in the receiver, the middle signal 3220 is the word clock signal used to synchronise PCM data 
framing in the receiver and the lowermost'signal 3230 is the multiplexed clock signal in which the negative-going tran- 
sitions have been shifted. The multiplexed clock signal 3230 is the clock signal that is transferred over the MAC-DSD 
link. In Figure 32 time is increasing to the right along the horizontal axis. It can be seen that the positive going edge 
3212 of the 64Fs clock signal coincides with the positive-going edge 3222 of the word clock signal. Accordingly, the 
preceding negative-going edge 3214 of the 64fs clock signal has been shifted back in time by time t cfkmod thereby 
reducing the width of that clock pulse (see edge 3234 of in the multiplexed clock signal 3230) whilst the subsequent 
negative-going transition 3216 of the 64fs clock edge has been shifted forwards in time by a corresponding time in- 
crement t dkmod (see edge 3236 of the multiplexed clock signal 3230) thereby increasing the width of the pulse. The 
negative transition 3236 after the word clock edge 3222 is delayed by the same amount that the preceding negative 
edge 3234 is advanced. The delay of the subsequent negative-going transition 3236 is performed by way of compen- 
sation to avoid DC content in the signal. DC content in the signal is likely to cause periodic "baseline shift" at the word 
clock frequency, when the signal is transmitted in an AC-coupled system. Performing this compensation on the clock 
cycle following the coincidence of the word clock and 64Fs clock also reduces the Fs/n clock frequency content in the 
64fs signal. This is important, since it is desirable to reduce low- frequency jitter in the received 64fs clock, which is 
typically used to feed a PLL circuit to generate an ADC/DAC audio sample clock. The edge offset time (td^^) shown 
in Figure 32 is exaggerated for ease of illustration. The true time shift will typically be very small, for example, one 
2048fs period (11.07ns, at Fs = 44.1kHz). Note that the shift or "pulse width deviation" introduced to the clock mux 
signal shown in Figure 32 does not occur every word clock cycle. Rather the clock pulse width deviation only occurs 
once every n clk_fs cycles, where n is an integer value controlled by a register. Effectively, introduction of the clock 
pulse width deviation every n word clock cycles amounts to multiplexing a clock signal of frequency Fs/n with the 64fs 
clock. Since the frequency of the sample clock (word clock) is known, all that needs to be communicated by the trans- 
mitter is phase information which enables the receiver to reconstitute the word clock signal with a six-bit counter. The 
counter is reset by the Fs/n signal and incremented by the 64fs clock. Note that the signal forms of Figure 32 apply to 
both the transmitter (which generates the multiplexed clock) and receiver (which generates the Fs clock) ends of the 
connection. 

[0126] Figure 34 schematically illustrates a MAC DSD transmitter 3400 (the counterpart of the FPGA 512 in Figure 
6) adapted for transmission of both PCM and DSD data. The MAC DSD transmitter module comprises: a 64Fs clock 
generator 3410; an Fs sync generator (word clock generator) 3420; a clock multiplexer module 3430, a counter 3440; 
an S-bit generator 3450; an encoding and block construction module 3460; an interleaver 3470; a FIFO buffer 3490 
and a frame assembler 3492. 

[01 27] The clock multiplexer 3430 generates the pulse width deviated clock signal (illustrated in Figure 32) by shifting 
certain negative-going edges of the 64Fs clock signal in dependence upon output from the word clock sync generator 
3420. The pulse width deviated clock signal is transmitted across the twisted pair cable to the receiver. The counter 
3440, keeps track of the 64fs clock signal in order to pre-empt the occurrence of the Fs sync signal. It is necessary to 
pre-empt the Fs sync signal to facilitate generation of the S-bit in the audio data stream, which is performed by the S- 
bit generator module 3450. Note that the PCM samples are individually labelled with sync markers via the M-bit en- 
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coding (see 27-bit PCM audio sample structure of Figure 27) whereas DSD mode frames rely on a frame flag bit being 
set in the transmitter and the marker bit of the first sample of the flagged frame being set on entry to the receiver FIFO. 
The output of the S-bit generation module 3450 is supplied to the encoding and block construction module where parity 
bits are generated and padding bits are inserted for PCM mode frames only to construct the 32-bit data blocks of the 
5 frame payload (see Figure 18B). Data blocks from the encoding and block construction module 3460 are supplied to 
the interleaver 3470 which outputs 4-bit nibbles of interleaved data to the FIFO buffer 3490. The transmitter FIFO 3490 
bridges the audio clock and link clock (PHY 514 in Figure 6) domains of the transmitter. The transmitter FIFO buffer 
3490 is 25 bits wide. Of the 25 bits, 24 bits are associated with 24 respective channels of concurrent DSD or PCM 
audio samples, the 25th bit being reserved as a synchronisation marker. The 25 th bit indicates whetherthe correspond- 

10 ing DSD or PCM audio sample occurred simultaneously with an Fs/n clock edge in the transmitter. This is illustrated 
in Figure 33 which shows five consecutive DSD samples (n-2), (n-2) , n, (n+1 ), (n+2) and their timing relationship with 
the local 64Fs clock and the word clock. It can be seen that sample n corresponds in time to the coincidence of the 
positive going edge of the word clock and 64Fs clock. Accordingly the positive-going edge of the marker bit coincides 
with the beginning of DSD sample n. Data is read out from the transmitter FIFO 3490 in dependence upon the occupancy 

15 threshold (as described above with reference to Figure 1 4) and supplied to the frame assembler 3492. Data from the 
frame assembler 3492 is supplied to the PHY of the transmitter. The transmitter start-up procedure differs slightly for 
PCM mode and DSD mode operations. In PCM mode on start-up, the transmitter starts transmitting as soon as possible. 
Marked samples are explicitly indicated via the PCM sample subf rame 'M-bif encoding. However in DSD mode marked 
samples are not explicitly indicated but are derived from flag bit 1 of the frame flags as specified in the table of Figure 

20 30. Accordingly, on start-up in DSD mode, the transmitter holds-off transmitting the first frame until one of the marked 
samples (i.e. sample synchronous with Fs/n clock) is available in the FIFO. While the transmitter is in this hold-off 
state, samples are read-out of the PHY clock side of the FIFO and dropped. When a marked sample becomes available 
(as indicated by flag bit 1), the interleaving, encoding and frame formatting mechanisms are enabled, such that the 
first sample in the first frame is the marked sample. From this point, frame transmission is governed by the buffer status 

25 (to initiate frame assembly) and frame format rules. 

[0128] Figure 35 schematically illustrates a MAC DSD receiver 3500 (the counterpart of the FPGA 526 in Figure 7) 
adapted for reception of both PCM and DSD data. The MAC-DSD receiver 3500 comprises: an Fs/n sync detection 
module 351 0; an Fs clock generation module 3520; a monostable counter 3530; a frame receiving and decoding module 
3540; a FIFO buffer 3550; a deinterleaver 3560; and a decode/block deconstruction module 3570. The Fs/n sync 

30 detection module receives the pulse width deviated clock signal from the twisted pair cable and determines the relative 
phases of the 64fs clock and the word clock on the basis of this signal. The Fs/n phase information is supplied as input 
to the word clock generation module 3520, which outputs the word clock (Fs) signal. 

[0129] The incoming cable clock signal is passed directly to the local phase locked loop of the receiver system in 
order to synchronise the system. It is not possible to use the extracted Fs clock derived from the word clock generation 

35 module 3520 for this purpose. This is because the word clock generation module 3520 requires sequential logic that 
is clocked form the local PLL so that the extracted signal is always synchronous wit the local PLL. This means that the 
output of the word clock generation module 3520 is unsuitable as a synchronisation source for the PLL. 
[0130] Note that the Fs clock signal in the receiver is of the same phase as the Fs clock signal in the transmitter as 
a result of the Fs/n sync. The Fs/n phase information is also supplied to the monostable counter. The monostable 

40 counter is triggered by reception of each Fs/n indication to count 64fs clock periods. The FIFO output is disabled on 
detection of the first marked sample in the FIFO 3550, whereupon the FIFO begins to fill with data. After a number of 
64fs cycles equal to the predetermined link latency, the Fl FO 3550 outputs are enabled. The predetermined link latency 
incorporates the delay incurred in the transmitter due to data encoding and frame assembly plus the delay incurred at 
the receiver due to the decoding process. The predetermined latency of the data link is programmed to be an exact 

45 multiple of 64fs clock periods measured with respect to the Fs/n sync signal transmitted on the cable clock. 

[0131] Mil frames (comprising nibbles) from the PHY 526 of the receiver (see Figure 7) are supplied as input to the 
frame reception and decoding module, where header data is removed, and error checks are performed. The decoded 
data is supplied as input to the FIFO 3550 in the form of Mil nibbles. The FIFO outputs 4-bit data nibbles, which are 
supplied to the deinterleaver 3560 for deinterleaving. The deinterleaved data is then fed to the decode/block decon- 

50 struction module 3570 where the audio data payload data is extracted and output as an audio data stream. 

[0132] Figure 36 schematically illustrates a system in which two sample-synchronous links are operated in parallel 
and in which the Fs/n sync signal is used to synchronise the parallel links. The system comprises a transmitting device 
3600 which is connected by a first cables 3603 and a second cable 3605 to a receiving device. The transmitting device 
3600 has a first MAC-DSD transmitter 3610 which is connected to a first MAC-DSD receiver 3710 in the receiving 

55 device 3700 via the first cable 3603. The transmitting device 3600 also has a second MAC-DSD transmitter 3620 which 
is connected to a second MAC-DSD receiver 3720 in the receiving device 3700 via the second cable 3603. The two 
MAC-DSD transmitters 3620, 3620 are driven by an internal clock source 3630 that supplies them with both a 64Fs 
clock and a word clock. In the receiving device 3700 only the first MAC-DSD receiver 3710 acts as a clock source 
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thereby serving as a master clock. This receiver 3710 derives the word clock signal and the 64Fs clock signal from 
the multiplexed clock signal received via the first cable 3603. Note that if a separate word clock source were used then 
neither of the MAC-DSD receivers 371 0, 3720 would serve as a master clock source. The 64Fs and word clocks 
extracted from the link cable 3603 are supplied to a PLL 3730 that outputs a word clock signal and a 64Fs clock signal 
5 to both the first MAC-DSD receiver 371 0 and the second MAC-DSD receiver 3720. The second MAC-DSD receiver 
3720, which is not serving as the master clock source, should re-clock the multiplexed clock signal received via the 
second cable 3605 in order to detect the Fs/n indicator (i.e. the clock pulse width deviation). The propagation delay 
on the link via the first cable 3603, , is likely to be different from the propagation delay on the link via the second cable 
3605. The difference in propagation delay between the first link 3603 and the second link 3605 is determined by com- 
10 paring the position of the received 64fs clock edges with the locally-regenerated 64fs clock (from PLL 3730), and by 
comparing the position of the received Fs/n indicator with the locally-regenerated Fs word clock, (also from PLL 3730). 
Figure 37 schematically illustrates a measured difference in propagation delay between the two links. It can be seen 
from Figure 37 that the positive-going clock edge 3812 immediately following the shifted negative-going clock edge 
(pulse width deviated pulse) in the clock multiplexed signal 381 0 is shifted relative to the corresponding positive-going 
'5 clock edge of the locally regenerated 64Fs clock signal 3822 and relative to the positive edge 3832 of the locally 
regenerated word clock signal 3830 by an amount In particular, the received cable clock Fs/n indicator occurs 
later in time than the local Fs clock edge. Given that the local Fs clock edge is derived to be synchronous with the 
received cable clock Fs/n indicator on the clock master MAC-DSD link, this indicates that the cable propagation delay 
for the second link 3605 is longer than the cable propagation delay for the clock master link 3603. The relative difference 
20 in propagation delay between the clock master link 3603 and the other link 3605 is The time t offS€t is defined to 
be negative in the case that the non-master link 3605 is delayed relative to the clock master link 3603 as shown above, 
and positive in the case that the non-master link 3605 is advanced relative to the clock master link. 
[0133] Once t^^ is determined at the receiver, the following algorithm must be followed to adapt the latency mon- 
ostable counter 3530 of the receiver to ensure synchronous operation with the clock master link. If t^^ is positive (i. 
25 e. non-master link 3605 is advanced in time relative to clock master 3603 link) then when the Fs/n indicator is detected 
via link 3605 the latency monostable counter in MAC_DSD receiver 3720 is not started until the next word clock edge. 
However, if t offset is negative (i.e. non-master link 3605 is delayed relative to master link 3603 as in Figure 37) t offset is 
rounded down to an integer number of 64fs periods and one is subtracted from this value to derive a value for the 
timeout for the non-master latency monostable counter. The latency monostable counter in MAC-DSD 3720 (non- 
30 master) is started at the first 64Fs clock edge following the timeout. This will result in the non-master latency monostable 
counter timing out synchronously with the monostable counter in the clock master receiver. 

[0134] If the predetermined link latency period expires before a marked sample is detected in the FIFO 3550 this is 
an indication that either there is a fault in the system or that the predetermined link latency has been set at too small 
a value for the link conditions. Accordingly, if the latency period expires before the marked sample is detected an 
35 interrupt signal is raised and error indicator bits are set. Table 1 0 below specifies for each of seven audio data formats 
an example link latency in 64Fs periods and in microseconds. 



Table 10 



Audio format 


Latency (64fs periods) 


Latency (^s) 


DSD 


127 


44.9 


44.1kHz PCM 


1 92 (3 samples) 


68 


48kHz PCM 


1 92 (3 samples) 


62.5 


88.2kHz PCM 


1 60 (5 samples) 


56.6 


96kHz PCM 


1 60 (5 samples) 


52.1 


176.4kHz PCM 


1 44 (9 samples) 


51.0 


192kHz PCM 


144 (9 samples) 


46.9 



[01 35] A further development of the arrangements of Figures 2 to 37 will now be described. The substantive changes 
incorporated in this further development are as follows: 

i. Auto-crossover support is implemented to enable automatic configuration of the devices having Media Dependent 
Interfaces when they are connected to the LAN. 

ii. Synchronisation mechanisms have been unified so that the S-bit is not required for PCM mode - both DSD mode 
and PCM mode use the synchronisation mechanism according to which the first sample of the first frame is aligned 
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with the fs/2048 synchronisation indicator. 

Hi. Frame format ID structure is modified from that described in relation to Figures 43 and 44 A to F. 

[0136] Table 1 above specifies the settings of socket terminal connections of the RJ45 connector plug. In the single 
cable arrangement of Figure 3 a special "crossover" category 5 cable was required to reverse the input/output con- 
nections to facilitate an effective point to point link between the two audio devices. The crossover cable is required 
because of the fact that the assignment of different connector pins to the wires in the category 5 twisted pair cable at 
the RJ-45 is predetermined. Clearly, if the transmit pair of wires of one device is connected to the transmit connectors 
of the other audio device then the communication link will fail. As a result of the predetermined RJ-45 connector pin 
assignments, it may well be that the two audio devices use the same pin designations on their interfaces, in which 
case, a crossover cable is required. By way of contrast, standard category 5 cables could be used for the star config- 
uration arrangement of Figure 4. The arrangement according to the further development of the present technique 
makes use of auto -crossover technology for automatically ensuring viable connections between networked devices 
without the need for crossover cables. 

[0137] The RJ45 connectors and category 5 cables are associated with the MDI 211 of Figure 1. Table 11 below 
specifies the settings of the RJ45 socket connections for the arrangement that implements auto-crossover (compare 
with Table 1 ). There are two different pin allocation settings corresponding to two distinct modes: MDI signal allocation 
mode and MDI-X crossover signal allocation mode. According to this arrangement all RJ45 connectors feature the 
same contact assignments and switching between MDI and MDI-X modes is automatically negotiated so that crossover 
cables are not required. The crossover negotiation between MDI and MDI-X modes is independently performed for the 
data and for the synchronisation signal. The synchronisation signal electrical interface comprises two independent 
half-duplex (i.e. two-way alternate) signal paths, the two paths transmitting data in opposite directions (i.e. one output 
and one input). Automatic negotiation is used to control which of the signal paths is the transmitting path. Each signal 
path is a differential pair and differential signalling is used. Each physical layer transceiver device is capable of auto- 
matically switching between MDI and MDI-X modes. 



Table 11 



Contact 


MDI signal allocation 


MDI-X (crossover) signal allocation 


1 


Audio data transmit + 


Audio data receive + 


2 


Audio data transmit - 


Audio data receive - 


3 


Audio data receive + 


Audio data transmit + 


4 


Sync signal transmit + 


Sync signal receive + 


5 


Sync signal transmit - 


Sync signal receive - 


6 


Audio data receive - 


Audio data receive - 


7 


Sync signal receive + 


Sync signal transmit + 


8 


Sync signal receive - 


Sync signal transmit - 



[0138] The transceiver device controls the input/output direction of the two independent half-duplex synchronisation 
signal paths. Accordingly two different networked nodes (audio devices) on a link can automatically negotiate which 
synchronisation signal transceivers switch to the crossover contact allocations of Table 11 above. 

45 [01 39] Figure 38 schematically illustrates a state diagram representing how switching is controlled to achieve a cor- 
rect polarisation for the crossover circuit. In the state diagram state 381 0 is MDI mode, in which the synchronisation 
signal connector contact allocations are as indicated in the left-hand column of Table 11 . State 3820 is the crossover 
MDI-X mode, in which the synchronisation signal connector contact allocations are as indicated in the right-hand column 
of Table 1 1 . When a reset is performed the audio device the state variable sync_det is FALSE, indicating that no valid 

so synchronisation signal is currently being detected on the synchronisation signal paths. An asynchronous timer asso- 
ciated with the state variable a_timer in Figure 38 ensures an arbitrary initialisation of the state machine to one of the 
two possible initial states: MDI mode or MDI-X mode. The period of this asynchronous timer in the present arrangement 
is 100 milliseconds +/- 25%. The initialisation is implemented such that the probability of two different nodes taking 
pseudo-random bit sequence (PRBS) seed values at state machine initialisation is very low, 

55 [0140] Consider, for example the case where the node device is initialised to the MDI state. The node waits in MDI 
mode 3810 for a predetermined time interval represented by the sample_timer state variable. In the present arrange- 
ment the sample timer has a period of 1 millisecond +/- 25%. During this sample time interval the node evaluates its 
receive channel to determine whether or not it is in receipt of a valid synchronisation signal from the other end of the 
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link. If a valid synchronisation signal is in fact detected then the state variable sync_det will have the value TRUE and 
the node will remain in the current configuration (MDI mode in this case) as indicated by path 3813 in Figure 38. 
However, if sync_det is FALSE, indicating that no valid synchronisation signal has been detected on the receive chan- 
nel, the next state is determined by the value of a pseudo-random number 

[01 41 ] Figure 39 schematically illustrates a Linear Feedback Shift Register (LFSR) used to create a pseudo-random 
sequence. The LFSR comprises 11 stages S[0] to S[11], the output from stages S[8] and S[10] being supplied to an 
adder 391 0 whose output is fed back as an input to the first stage S[0] . The output of S[1 0] provides a single-bit pseudo- 
random sequence for use in the state machine algorithm of Figure 38. The LFSR is controlled by the sample timer so 
that a value RND(sampleJimer) is generated periodically. 

[0142] Referring back to Figure 38, if sync_det is FALSE and RND(sample_timer) = 1 then a state transition is made 
from MDI mode to MDI-X mode (as indicated by path 3815 ) whereupon the signal allocations of each of the eight 
contact pins are assigned according to the right-hand column of Table 1 1 . However if RND(sampleJimer) = 0 the node 
will remain in the present state as indicated by path 3813. Considering the case where a transition has been made 
from the MDI mode 3810 to the MDI-X mode 3820 following a failure to detect a valid synchronisation signal in MDI 
mode and a pseudo-random number generation event of RND(sample_timer) = 0. In this case the node device again 
waits for a time corresponding to the sample timer period and evaluates its receive channel to determine whether a 
valid synchronisation signal is being received from the other end of the link. If sync_det = TRUE indicating that a valid 
synchronisation signal has in fact been received then the node device remains in the MDI-X mode, as indicated by 
path 3819. However, if sync_det = FALSE then the LFSR is incremented and a decision on whether to switch states 
is dependent upon the pseudo-random output of the LFSR. In particular, if RND(sampleJimer) = 0 a transition is made 
from MDI-X mode to MDI mode as indicated by path 381 7, whereas if RND(sampleJimer) = 1 the node device remains 
in MDI-X mode despite no valid synchronisation signal having been detected in the previous sample timer period. 
[0143] Figure 40 schematically illustrates an arrangement for audio clock transmission according to a development 
of the present technique. The transmission arrangement comprises a clock signal path having clock multiplexer 401 0, 
a Low Voltage Differential Signalling (LVDS) transmitter 4020 and a clock transformer 4030 connected in series to an 
RJ45 connector 4040. Thus the clock signal of this arrangement is a transformer coupled M-LVDS (Multipoint Low 
Voltage Differential Signalling) signal. The data signal path comprises a High Resolution Multi-channel Audio Intercon- 
nection (HRMAI) logic 4050 operable to input/output audio data, connected in series to a physical layer device (PHY) 
4060 which is in turn connected to a data transformer 4070 and finally to the RJ45 connector 4040. The arrangement 
of Figure 40 differs from that of Figure 6 in that the differential line driver 554 has been replaced by the LVDS transmitter 
4020 and the clock multiplexer 4010 is shown. However the clock multiplexer 4010 has the same function as clock 
mux 3430 as shown in Figure 34. In particular, the clock multiplexer 401 0 is operable to multiplex the 64fs clock and 
the fs word clock so that the clock signal contains an indication of the fs word clock phase via periodic modulation of 
one edge of the clock signal as described above with reference to Figure 32. Note that the S-blt generation described 
in relation to Figure 34 is not required for the arrangement of Figure 40. The LVDS transmitter 4020 is required in this 
arrangement to support auto-crossover functionality. The PHY 4060 in this arrangement is also configured such that 
it is capable of auto-crossover support. The PHY 4060 is synchronised form a local 25MHz crystal oscillator (as specified 
in ISO/I EC 8802.3:2000(E)). 

[0144] Figure 41 schematically illustrates an arrangement for audio clock reception according to a development of 
the present technique. The reception arrangement comprises an RJ45 connector 4110 that provides paths to both a 
clock line processing sequence and a data processing sequence. The clock line processing sequence comprises a 
clock transformer 4120, an LVDS receiver 4130 and a word clock extraction unit 4140. The data line processing se- 
quence comprises a data transformer 4150, a physical layer device (PHY) 4160 and a logic unit 4170 for processing 
both input and output audio data. The transformer — coupled clock signal is supplied to the LVDS receiver 41 30 where 
it is converted to a logic signal. The output of the LVDS receiver 41 30 is used to drive a local high-quality phase-locked 
loop circuit to generate a local audio reference clock. The output of the LVDS receiver 41 30 is also supplied as input 
to the word clock extraction unit 41 40 which derives a word clock signal from the received multiplexed clock signal and 
outputs a word clock signal to the local system. 

[0145] In the arrangement of Figures 40 and 41 the Medium Dependent Interface (MDI) pin allocation is such that 
the audio data is conveyed on the same pins as conventionally used for 1 0OBase-TX Ethernet (i.e. fast Ethernet). This 
audio data transmission/reception is implemented using PHY devices 41 40, 4260 that support auto-crossover. Support 
for auto-crossover is often denoted by "auto-MDIX" in PHY datasheets. 

[0146] The LVDS transmitter 4020 and LVDS receiver 4130 operate in half-duplex mode. The LVDS transmitter/ 
receiver is connected to MDI pins via the corresponding clock line transformers 4020 or 4130, each of which has a 1 : 
1 turns ratio. All four transformers i.e. clock line transformers 4020, 4130 and data line transformers 4020, 4130 in this 
arrangement are of the same 1 0/1 00 Base-T transformer type. The use of half-duplex LVDS transceivers 4020, 4130 
allows auto-crossover since the effective pin allocation of the transmit/receive pairs can be reversed id required. 
[0147] Figure 42 schematically illustrates a synchronisation signal physical arrangement according to the develop- 
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ment of the present technique. This figure shows the connections between the LVDS transceivers 4020, 4130 (half- 
duplex) and the RJ45 connector 4040, 4110 pins. One arrangement according to the development of the technique 
incorporates all of the physical layer auto -crossover circuitry (for both audio data and sync signal) integrated into a 
single package. This particular arrangement uses of a 1000Base-TX (Gigabit) physical layer circuitry package featuring 

5 four symmetrical centre-tapped transformers with line-side common-mode chokes. Since such 1000Base-TX PHY 
devices are generally also 1 0/100Base-T compliant, the device also works well with the sync signal. In a modification 
of the arrangement having integrated physical layer auto-crossover circuitry, the circuitry can be integrated within the 
RJ45 socket thereby providing a significantly reduced printed circuit board (PCB) layout area. However, this precludes 
use of an XL-shell ruggedised RJ45 connector (which is recommended), if the RJ45 on the PCB is an external connector. 

10 [0148] Figure 1 8A shows an audio data format for the 32 DSD channel embodiment which comprises 384 four-byte 
data words (1536 byte frame) and Figure 1 8B shows an alternative frame format for the 24 DSD channel embodiment 
comprising 368 four-byte data words (1472 byte frame) with a 1408-byte frame payload having 352 DSD audio data 
samples for 24 channels plus 88 bytes of auxiliary data. 

[0149] Figure 43 shows an audio data frame format according to the further development of the present technique. 

15 Each frame consists of 1448 bytes (or "octets"). The frame payload comprises a total of 26 logical channels, 24 of 
which are allocated for audio data transfer (as for the frame format of figure 1 8B) of either PCM or DSD audio data 
and the remaining 2 logical channels are allocated to auxiliary data transfer, bytes (or "octets"). 
[0150] The structure of the audio frame of Figure 43 generally conforms to the standard ISO/IEC 8802.3-2000(E) 
section 3.1.1 "MAC frame format". However, in the present arrangement the frame length is 1448 octets and the 

20 LENGTH/TYPE field (octets 20-21 in Figure 43) indicate the appropriate length value of 1 422 octets. In the audio frame 
format of Figure 43 octets 0-7 contain a preamble and start frame delimiter; octets 8-13 contain a MAC destination 
address; octets 14-19 contain a MAC source address; octets 20-21 contain a length/type field; octets 22-24 contain a 
Logical Link Control (LLC) header; octets 25-29 contain a SubNetwork Access Protocol (SNAP) header; octets 30-35 
contain a frame format identification header; octets 36-1443 contain the data payload; and octets 1444-1447 contain 
- 25- a 32-bit cyclic redundancy check. 

[0151] The SNAP header in octets 25-29 is an extension of the LLC header of octets 22-24. The SNAP header is a 
universal protocol identification system that is used in all IEEE802 networks, including Internet Protocol carried over 
Ethernet. The 5-octet identifier comprises the 3-octet Organisationally Unique Identifier (OUI) of the organisation de- 
fining the protocol (the AES ( in this case), plus a 2-octet protocol identifier defined by that organisation. The SNAP is 

30 preceded by the 3-octet Logical Link Control header, taking the value AA 16 -AA 16 -03 16 . The value of the LLC/SNAP 
header uniquely identifies the protocol according to the present technique. 

[0152] Figure 44A schematically illustrates the structure of the frame format identification header corresponding to 
octets 30-39 of the audio frame format of Figure 43. Bits 0:3 of octet 30 specify the protocol minor version of the protocol 
according to the present technique whereas bits 4:7 specify the protocol major version. Bits 0:3 of octet 31 specify the 

35 frame type (bitstream mode audio (e.g. DSD) or AES3-compatible mode audio (PCM)). The frame type definitions are 
specified in Figure 44B. Bits 4:7 of octet 31 contain flag bits used for synchronisation or other purposes. Value definitions 
for the frame type field are listed in Figure 44C for bitstream mode audio (DSD) and in Figure 44D for AES3 compatible 
(PCM) audio mode. Octet 32 specifies the audio format i.e. such details as the sample rate. Octets 33-34 are reserved 
for future use. Value definitions for the audio format field are specified in Figures 44E and F. Octet 35 contains an 8-bit 

40 cyclic redundancy check that is computed bitwise (bit 0 first) over octets 30 to 34 inclusive of the frame format identifier. 
The CRC encoding is defined by the following generator polynomial: G(x) = x 8 +x 2 +x +1 . In the event that the frame 
format CRC indicates that an error has occurred between transmission and reception of the data then the receiver will 
process that frame according to the frame format identifier of the previous frame. 

[0153] The data payload contained in octets 36-1 443 of Figure 43 comprise digital audio data (either PCM or DSD), 
45 auxiliary data and error correction coding data. The data is effectively transported via 26 logical channels, 24 of which 
are allocated to audio data transport and 2 of which are allocated to transport of auxiliary data. 
[01 54] Figure 45 schematically illustrates the frame payload formatting process according to the present technique. 
The process comprises five distinct stages: a PCM sample subframe formatting stage 4510; an audio channel multi- 
plexing stage 4520; an LC segment packing stage 4530; a forward error correction coding stage 4540; and an inter- 
so leaving stage 4550. If the digital audio input is PCM data then all five stages of processing are performed in order to 
produce the frame payload. However, if the digital audio input is DSD data the PCM sample subframe formatting stage 
4510 and the LC segment packing stage 4530 will be bypassed. 

[0155] The PCM sample sub-frame formatting stage 4510 adapt the format of each input PCM audio sample for 
transmission into the 27-bit sample sub-frame structure illustrated in Figure 26B. Bit 0 of the sample sub-frame is 
55 transmitted first and bits 0 through 23 comprise the audio sample data of a specific PCM audio sample. If the digital 
audio stream for transmission is compliant to AES3, then bit 24 shall comprise the User (U) bit, and bit 25 shall comprise 
the Channel Status (C) bit. However, if the digital audio stream for transmission is not compliant to AES3, then the U 
and C bits of Figure 26B will contain logic zero. In the event that the original audio sample is less than 24 bits in length, 
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the audio sample is right-aligned with respect such that the most significant bit (MSB) is always carried in bit 23, and 
any bits 0-8 not carrying audio data shall indicate logic zero. In this case, the actual audio sample length will be deter- 
mined from the frame format identification header (octets 30-35 of Figure 43). 

[01 56] According to this particular arrangement the M-bit (bit 26) of the AES3-mode sample sub-frame of Figure 26B 
5 is encoded to represent the following logic indicators: 

• B indicator:- the AES3 'start of block' indicator (equivalent to AES3 Z-preamble of the previous embodiments). The 
B indicator identifies a particular sample within the audio stream. 

• V indicator:- indicating that the audio data valid valid status. The V bit is sub-sampled every 192 PCM samples 
10 and indicated alongside the B bit. 

[0157] Note that this arrangement differs from the arrangement according to Figures 27A to D and Figures 28A to 
E since the S indication is not required in this case. This is because the DSD synchronisation method whereby a frame 
flag bit is used to mark the first sample of every frame is also applied to the PCM mode in this arrangement thereby 
is obviating the need for the S indicator. 

[0158] If neither the B indicator nor the V indicator is currently being asserted then the M-bit (of Figure 26B) indicates 
logic zero. In this embodiment the B and V indicators are asserted by sequences of four consecutive bits known as 
sync codes. The first bit of a sync code is always logic T, and the encoding of the remaining bits is as indicated in 
Table 12 below. 

20 

Table 12 



25 



Sync code 


Indicators asserted 


1000 


B and V (AES3 validity bit = 0) 


1100 


B and not V (AES3 validity bit = 1) 


other 


(reserved - disregarded at receiver) 



[01 59] The sync code timing is specified such that the bit immediately following the final (fourth) bit of the sync code 
30 marks the indicated sample. Figures 46 A and B give examples of M-bit encoding according to this arrangement. In 
these two figures, M-bit values of a series of consecutive samples are shown (temporally earliest sample is the left- 
most sample). The four-bit sync code according to Table 12 is indicated in bold and the letter 'B' above the string of 
M-bit values indicates the sample marked as being the AES3 block start. Figure 46A shows the sync sequence for B 
and not V (AES3 validity bit = 1 ) whereas Figure 46B shows the sync sequence for B and V (AES3 validity bit = 0). On 
35 reception of an "audio invalid" flag (validity = 0) in an incoming AES3 stream, the transmitter will replace all following 
sample data bits with zero, until the next B-bit indicator (which will carry the "audio invalid" V bit indication to the link 
receiver). 

[0160] Returning now to Figure 45, the audio channel multiplexing stage 4520 maps the multi-channel digital audio 
streams into 24 of the 26 logical channels. This process shall determines both the assignment of audio channels to 
40 logical channels and the order of transmission of audio samples on logical channels. In this arrangement there are six 
different mappings corresponding to six different audio mode/sample rate combinations: 

(i) Bit-stream audio (DSD) mode, bit-rate = 64fs:- the link will carry 24 audio channels. Each logical channel will 
carry one bit-stream audio channel. The bit-rate of the logical channel be 64fs. 
45 (it) Bit-stream audio (DSD) mode, bit-rate = 128fs:- the link will carry 1 2 audio channels, if the audio mode is 1 28fs 

bit-stream. Each bit-stream audio channel will be multiplexed over two of the logical channels. 
(Hi) PCM audio mode, sample-rate = 1fs:- the link will carry 48 audio channels, if the audio mode is 1fs PCM. Each 
LC will contain two multiplexed audio channels. 

(iv) PCM audio mode, sampie-rate = 2fs:- the link will carry 24 audio channels, if the audio mode is 2fs PCM. Each 
50 LC will contain one multiplexed audio channeL 

(v) PCM audio mode, sample-rate = 4fs:- the link will carry 12 audio channels, if the audio mode is 4fs PCM. Each 
audio channel will be multiplexed over two logical channels. 

(vi) PCM audio mode, sampie-rate = 8fs:- the link will carry 6 audio channels, if the audio mode is 8fs PCM. Each 
audio channel will be multiplexed over four logical channels. 

55 

[0161] The LC segment packing stage 4530 of Figure 45 is performed for PCM mode but not for DSD mode. This 
stage involves segmentation of the PCM data assigned to each logical channel by the insertion of null bits. Each 
segment (known as an LC sub-segment) is 27 bits in length, and LC sub-segment boundaries are arranged to coincide 
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with PCM sample sub-frame boundaries wherever possible. The insertion of null bits is arranged with respect to the 
LC frame segmentation (6.2 and Figure 4), in order to control the total number of PCM sample sub-frames conveyed 
in the LC frame segment. The number of LC sub-segments conveyed in the LC frame segment is an integer between 
9 and 13 inclusive, and is dependent on the total audio data rate (i.e. audio base sample frequency), in order to reduce 
the number of audio samples in the transmit/receive data buffers. One suggested relationship between audio base 
sample frequency and the number of LC sub-segments per frame is illustrated in Table 13 below. 



Table 13 



Audio base sample frequency range (Hz) 


Number of LC sub-segments per frame 


38580-37900 


9 


37900-42110 


10 


42110-46320 


11 


46320-50530 


12 


50530-54000 


13 



For each possible number of LC sub-segments (or sample subframes) conveyed per frame the arrangement of LC 
sub-segments and padding bits is as specified in Table 9A for the previous PCM arrangement. 
[0162] A number of examples of logical channel sub-segment formation and logical channel segment packing cor- 
responding respectively to stages 4510 and 4530 of Figure 45 will now be considered for four different audio data 
formats. 

(I) 21s PCM mode with 88.2 kHz sample rate 

[0163] In this case there are 24 logical channels available, so there is a simple one-to-one mapping between logical 
channels and audio channels. Each 1 448-byte physical layer data frame takes 1 1 5,84 \is to transmit (at 1 0OMbit/s PHY 
bit-rate), and each frame conveys one logical channel sub-segment (352 consecutive bits) of each logical channel. 
The frame transmission time of 115.84u.s is equal to about 10.2 samples, assuming a sample rate of 88.2 kHz. So 
each logical channel frame segment will convey eleven consecutive sample subframes from each channel, (a sample 
sub-frame is a 27-bit unit, comprising one 24-bit audio sample, plus three bits of associated metadata). These eleven 
sample sub-frames need to be distributed evenly throughout the 352-bit logical channel frame segment. Since each 
sample sub-frame is 27 bits, this may be achieved by following each sample sub-frame with 5 padding bits. This is 11 
x (27+5) = 352 bits, so no extra padding bits are required at the end of the frame (corresponding to row 3 of Table 9A). 

(ii) 2fs PCM mode with 96kHz sample rate 

[0164] Next, consider 24-channel (2fs) AES3-compatible mode, where the sample rate is 96 kHz. In this case, due 
to the higher sampling frequency the 115.84u.s frame transmission duration is equal to about 11.1 samples, so each 
logical channel frame segment will convey 12 consecutive sample sub-frames from each channel. Again, these 12 
sample sub-frames should be distributed evenly throughout the 352-bit LC frame segment. Since each sample sub- 
frame is 27 bits, this may be achieved by following each sample sub-frame with two padding bits. This is 12x(27+2) = 
348 bits, so four extra padding bits are required at the end of the frame to make up the 352-bit LC frame segment 
length (corresponding to row 4 of Table 9A). 

(iii) 4fs, 12 channel PCM mode with sample rate of 192kHz 

[0165] This is similar to case (ii) above but in 12-channel (4fs) mode, where the sample rate is 192 kHz. Each audio 
channel is multiplexed over two logical channels. But the net data rate per logical channel is identical to the previous 
(96 kHz) case (ii). The same structure of packing eleven 27-bit units into each LC frame segment is retained, although 
these units are now referred to as "LC sub-segments". Each audio sample (i.e. sample sub-frame) is multiplexed, bit- 
by-bit, across two logical channels, into the LC sub-segments. Each LC sub-segment therefore contains half the bits 
from one audio sample sub-frame, and half the bits from the following audio sample sub-frame. 
[0166] The data is multiplexed bit-by-bit across the LC sub-segments to avoid the need to buffer one entire audio 
sample sub-frame, since this would be contrary to the objective of reducing the latency. The logical channel packing 
structure is retained from the simpler 2fs case, in order to maintain consistency at the logical channel level to simplify 
implementation. 
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[0167J The 8fs case, where each audio channel is multiplexed across four logical channels, is a simple extrapolation 
of the 4fs case just described. 

(iv) 1(s PCM mode with 48 channels 

[0168] In this mode two audio channels are multiplexed into each logical channel. Again, the same logical channel 
packing structure is retained, but two concurrent audio sample subframes (from different audio channels) are multi- 
plexed, bit-by-bit, into two consecutive LC sub-segments. 

[0169] Figure 47 schematically illustrates the logical channel frame segmentation process. Digital audio input data 
for transmission undergoes the stages of processing illustrated in Figure 45 (PCM data undergoes five processing 
stages whereas DSD data undergoes three processing stages) and an audio frame payload output of 24 logical chan- 
nels at a frequency of 64fs is produced. Each frame payload is formed from 24 concurrent 352-bit segments of logical 
channel data. 

[0170] Figure 48 schematically illustrates a 26-bit logical channel slice comprising 24 bits of audio data and 2 bits of 
auxiliary data. Six parity bits are added to this LC slice to create a 32-bit encoded block as for the previous embodiments 
and as shown in Figure 22 where logical channel bits 24 and 25, corresponding to the auxiliary data bits are labelled 
as AO and A1 and the six parity bits are labelled PO to P5. The parity bits are generated by performing exclusive-NOR 
logic operations on sub-sets of the logical channel slice of Figure 43 as specified by the table of Figure 23A. 
[0171] As indicated at stage 4550 of Figure 45, the 32 bit encoded blocks are interleaved before transmission on 
the physical layer. The interleaving process defines the order in which elements of the encoded blocks shall be trans- 
mitted on the 4-bit MM interface. In this particular arrangement the order is specified according to the relationship: 

l(i J k)=B((32int(i/256)+4(imod(8))+k),int((i/8)mod(32))) 

where: 

0<i<2815; 
0 < k <3; 

l( i, k ) is an array of 2816 units transmitted on the Mil interface, each 4 bits wide. 1(0, k) is the first to be transmitted 
in the frame, whereas 1(2815, k) is the last; and 

B( m, n ) is an array of 352 encoded blocks generated in the transmitter, each 32 bits wide. Encoded block B(0, 
n) contains the first bit of the LC segments in the frame, and encoded block B(351 , n) contains the last. (0 < m < 
351, 0£n<31). 

[0172] The above definition of the interleaving relationship assumes that the interface to the physical layer device is 
an implementation of the Media Independent Interface (Mil). However, alternative interfaces may be used provided 
that the resulting signal transmitted is logically identical. 

[0173] Figure 45 schematically illustrates the sequence of operations performed in the transmitter to prepare the 
audio data for transmission. Generally, at the receiver, the steps of Figure 45 are performed in reverse to recover the 
audio data and auxiliary data. However instead of the inverse of the error correction encoding process described above, 
a six-bit syndrome is calculated by performing exclusive-OR logic operations on sub-sets of the received encoded 
blocks as described in relation to the previous arrangements with reference to Figure 23B and Table 8. 
[0174] The transceiver device according to this arrangement is operable to maintain counts of the following events 
in order to provide an indication to the user of the link error rate.: 

a) Total number of Mil symbols received since the link was established (minimum 48-bit binary counter). 

b) Total corrected errors since the link was established (minimum 16-bit binary counter). A corrected error condition 
occurs when the error correction syndrome indicates one correctable error (see Table 8). 

c) Total uncorrected errors since link was established (minimum 16-bit binary counter). An uncorrected error con- 
dition occurs when error correction syndrome indicates an uncorrectable error (see Table 8). 

[0175] The system provides a indication to the user (for example, a pulse on a visible warning indicator) when an 
error is corrected. On detection of an uncorrected error, the local system incorporating the transceiver will indicate this 
to the user via an appropriate warning device. If the audio mode is PCM, the receiving section of the transceiver may 
use an interpolation-based algorithm to conceal an isolated error. However, if error concealment is not used or if the 
audio mode is DSD rather than PCM or indeed if the portion of the received PCM digital audio stream in error is too 
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long for effective operation of the error concealment algorithm then the portion of the received digital audio stream in 
error will be replaced with audio silence. 

[0176] Recall that in the previously described arrangement of Figure 1 8B, the 88 bytes of auxiliary data were incor- 
porated in a 1 408 byte payload and the 32 bit data block comprised 24 audio data bits, 2 auxiliary data bits and 6 parity 
5 bits. 

[0177] Figure 49 schematically illustrates how the auxiliary data bits (two per LC slice) are multiplexed over two 
logical channels. The two logical channels assigned to carry auxiliary data can be considered to transport a single bit- 
stream, at double the bit-rate of each individual logical channel but multiplexed over the two logical channels. As shown 
in Figure 49, adjacent bits of the auxiliary data stream are transported on different ones of the two assigned logical 
10 channels. Access to the auxiliary data stream is implemented within the transceiver as an Ethernet communication 
service. The system router contains a packet switch for auxiliary data that is independent of the audio routing. In the 
present arrangement the auxiliary data comprises: timecode data, transport control data, remote gain control data, 
router control data and metadata describing associated audio content. Table 1 4 below specifies a 1 024-octet auxiliary 
data frame format according to the present arrangement. 

15 

Table 14 



Auxiliary data frame Octet 


Content 


0-7 


Preamble and start frame delimiter 


8-13 


MAC destination address 


14-19 


MAC source address 


20-1019 


Payload 


1020-1023 


32-bit frame CRC (ISO/IEC 8802.3) 



[0178] In the developed arrangement according to the present technique auxiliary data is transferred in packets (of 
variable size) across the link. The minimum permissible packet size is 72 octets, and the maximum permissible packet 
size is 512 octets. The auxiliary data packets are structured according to the IEE802.3 MAC frame format (see ISO/ 
30 IEC 8802-3: 2000(E) section 3.1.1 for further information) standard, with the exception that the maximum total frame 
length is 1024 bytes. The two logical channels provide a full-duplex synchronous serial interface (having a typical bit 
rate of 128Fs) over which the auxiliary data packets can be carried. 

[0179] The start and end of a packet is delimited by the presence in the bit-stream of the unique binary pattern: 
01111111110 (0; nine Vs; 0). This binary pattern is denoted an "auxiliary data packet delimiter". In-between packets, 

35 the auxiliary data bit-stream shall comprise contiguous auxiliary data packet delimiters. The uniqueness of this pattern 
is assured by inserting an additional zero into the packet data after any eight consecutive 1 's (a "bit-stuffing" process). 
[0180] Figure 50A schematically illustrates a convolutional encoder used to scramble the auxiliary data prior to "bit- 
stuffing" at the transmitter. Figure 50B shows the corresponding descramblerthat is applied to the auxiliary data at the 
receiver. The encoder of Figure 50A comprises nine registers and two exclusive OR (XOR) gates 5010, 5020. A first 

40 XOR gate 5010 has one input from the last of the nine serially connected registers and a second input derived from 
the output of the fifth register. The second XOR gate 5020 has one input corresponding to a non-delayed signal value 
and a second input derived from the output of the first XOR gate 501 0. The states of all nine registers in the encoder 
are pre-set to logic value 1 prior to transmission of each audio data frame. The convolutional decoder of Figure 50B 
is appropriately arranged to reverse the convolutional encoding of the circuit of Figure 50A. The scrambling is performed 

45 in order to randomise the data content, to avoid content-dependent changes in the overhead as a result of the "bit- 
stuffing" process. For example, a data stream comprising many consecutive 'Vs will cause dramatically increased 
overhead compared to random data. Convolutional scrambling of the data prior to insertion of the sync flags and prior 
to bit stuffing alleviates this problem. 

[0181] Figure 51 schematically illustrates an auxiliary data protocol hierarchy. The protocol stack comprises the 
5Q following layers from lowest to highest: PHY interface layer 5110; High Resolution Multi-channel Audio Interconnect 
frame format layer 51 20; payload encoding layer 51 30; audio/auxiliary data ILC formatting layer 51 40/42; and Auxiliary 
data frame format layer 51 50. In the PHY interface layer 51 1 0 the data comprises physical layer (Medium Independent 
Interface) frames prefixed only by a preamble. In the frame format layer 5120, the data comprises 1448-byte frames 
having Ethernet MAC headers, protocol specific identification and a 1408 byte payload. In the payload encoding layer 
55 5140, the data is mapped into 26 logical channels and is forward error correction encoded and interleaved. In the next 
highest layer i.e. the audio/auxiliary data ILC formatting layer 5140/42 the data is separated into either audio data 
logical channel format 5140 in which the audio data is encapsulated in a format in dependence upon whether it is DSD 
audio data or PCM audio data and then multiplexed onto 24 of the 26 logical channels or auxiliary data logical channel 
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format 5142 in which the auxiliary data frames are demarcated in a contiguous 128Fs bitstream and multiplexed into 
the remaining two logical channels. At the highest protocol layer 51 50 the auxiliary data is packaged in software-defined 
Ethernet frames. The higher audio layers of the protocol stack are not shown in Figure 51 . However, Internet Protocol 
(in conjunction with TCP, UDP or SNMP (Simple Network Management Protocol) are used in some arrangements 

5 according to the present technique to permit integration of the auxiliary data system with local IP networks that are 
used for system control. Such arrangements have a router device with a number of ports, the audio connections of 
which are interconnected by a low-latency cross-point switch (functionally similar to a conventional audio patchbay). 
The auxiliary data connections are linked to a commercially-available Ethernet switch IC, which provides full Ethernet 
network functionality for the auxiliary data system, whilst retaining the performance benefits of the multi-channel audio 

10 connection according to the present technique and channel-switched routing for the audio. In yet further arrangements, 
some extra ports on the Ethernet switch IC are brought out to dedicated 100Base-TX Ethernet external connections. 
This allows the auxiliary data network to be fully integrated with a conventional local area network — the only restriction 
being the reduced bandwidth (about 5 Mbit/sec) of the above described auxiliary data links. 

[01 82] The synchronisation process for this developed arrangement is the same as that described above (see Figures 
15 27 to 37) for the previous arrangements with one exception. Previously the PCM samples were individually labelled 
with sync markers via S indicator in the M-bit encoding (see Figures 26 to 28) whereas the DSD mode frames were 
synchronised by aligning the first sample in the first frame with the fs/2048 sync indicator. However in the development 
of these arrangements, the same synchronisation mechanism is applied to both PCM and DSD data. In particular, the 
DSD synchronisation mechanism whereby the first sample in the first frame is aligned with the fs/2048 sync indicator 
20 is also applied to the AES3 compatible (PCM) mode. 

[0183] The synchronisation signal is transmitted by all nodes, and carried independently in both directions along the 
link. The synchronisation signal shall is a 64fs clock signal, with periodic shifts in the timing of the negative edge every 
2048 fs periods as described above with reference to Figure 32. These edge timing shifts are derived from the phase 
of the word clock, and known as phase indicators. The synchronisation signal for transmission is derived from the local 
25 ifs word clock and 64fs clock signals, as illustrated in Figure 32. If a given device has multiple transceivers then all 
HRMAI transceivers will transmit phase indicators simultaneously. 

[0184] Each transceiver incorporates means to detect whether the transmitted and received synchronisation signals 
have identical frequency (that is, whether both devices are synchronised to the same clock source). If the transmitted 
and received clock signals do not have identical frequency then a clock error condition is indicated. In particular, if the 
30 detected interval between received synchronisation signal phase indicators is found to be anything other than exactly 
2048 fs periods (that is, 131072 64fs transitions), a clock error condition shall is indicated whereupon the following 
actions will occur in the transceiver: 

a) The audio outputs from the transceiver to the local device are muted; 
35 b) The transmitter logic is reset following completion of the frame currently being transmitted. This reset condition 

persists until the next local fs/2048 synchronisation signal. The next frame transmitted shall indicate an audio 
stream synchronisation marker; and 

c) The auxiliary data system remains fully operable, apart from a brief pause in operation while the transmitter is 
under reset condition. 

40 

[0185] The resulting pause in frame transmission may cause some audio samples to be lost. After the transmitted 
and received synchronisation signals have been measured to have identical frequency for approximately 1 00 ms, the 
audio outputs shall be un-muted. Disconnecting the link cable will cause the receiver outputs to mute without producing 
any audible artefacts. Connecting the link cable will cause the receiver outputs to un-mute, without producing any 
45 audible artefacts. 

[0186] If a device incorporating a transceiver 51 0, 520 is configured to synchronise its master clock from the received 
link synchronisation signal: it is recommended that the transceiver should inhibit transmission of a synchronisation 
signal, until the local device has detected clock synchronisation. 

[0187] The audio stream synchronisation markers are embedded in the transmitted audio stream in dependence on 
50 the synchronisation signal phase indicators (see Figure 32). An audio stream synchronisation marker is associated 
with an audio sample that is clocked into the transmitter simultaneously with transmission of a phase indicator (that is, 
the synchronisation markers are generated in dependence on a local 2048fs clock). Frame transmission in both DSD 
mode and PCM mode is arranged such that the audio sample associated with an audio stream synchronisation marker 
is the first audio sample conveyed in a frame (neglecting sample reordering at the interleaving stage). 
55 [0188] If an audio sample associated with an audio stream synchronisation marker is received, it shall be output to 
the local device a pre-determined and constant time after reception of the preceding synchronisation signal phase 
indicator. This pre-determined and constant time comprises the total latency of the link, neglecting cable propagation 
delay. The latency is dependent upon the audio mode and audio sampling frequency. 
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[0189] Figure 52 schematically illustrates synchronisation connections between the transceivers 510, 520 (see Fig- 
ure 5) according to the present technique (including transceivers of the developed arrangement that support auto- 
crossover) and the rest of the local system, for example an audio data recorder or A/D converter. The arrangement of 
Figure 52 comprises a transceiver 5210 (counterpart of devices 510 or 520 in Figure 5) having a phase detection 

5 module 5212 and a sync signal generation module 5214. The transceiver is connected via a switch 5220 to the phase 
locked loop of the local system having a positive edge-triggered phase comparator 5230 in series connection with a 
low- pass filter 5240 and a voltage-controlled crystal oscillator (VCXO) unit 5250. The output of the VCXO unit 5250 
is supplied as input to a system master clock divider 5260 which in turn outputs clock signals to the rest of the local 
system. A 64Fs sync signal derived from the cable is sent from the transceiver 5210 to the phase locked loop circuitry 

10 of the local system 5220, 5240, 5250. The phase detection indicator 5212 outputs a received fs/2048 indicator to the 
system master clock divider 5260. The system master clock divider 5260 outputs both a transceiver master clock signal 
and a local f2/2048 indicator to the sync generation module 5214 of the transceiver 5210 for output as a sync signal 
onto the cable. 

[0190] As described above with reference to Figure 32. The synchronisation signal is functionally a multiplex of 64fs 

15 and fs/2048 clock signals. In practice, this is achieved by transferring a 64fs clock in the positive edge timing of the 
clock signal, and modulating the pulse width of this signal to indicate the relative phase of the fs/2048 clock. Since the 
fs/2048 clock is phase-aligned with local word clock, the word clock is effectively conveyed across the link. The syn- 
chronisation signal may is used at the receiving device as a "clock source" (that is, an audio sample clock synchroni- 
sation source), so that a separate clock line between the devices is not required. In the arrangement of Figure 52 the 

20 positive edges of the received sync signal from the transceiver are used to synchronise the master phase locked loop 
local of the local system which is VCXO based. This is the clock source for the transceiver, including the logic to detect 
the phase indicators in the received sync signal. The detected fs/2048 phase indicators are used to preset the local 
system's master clock divider, such that the signals "received fs/2048 indicator" and "local fs/2048 indicator" in Figure 
52 ta_ke identical phase. A signal labelled "local fs/2048 indicator" in Figure.52 is derived from the most significant bit 

25 of the counter in the local system's master clock divider in the present arrangement. 

[0191] In this way, the fs/2048 indicators at each end of the link take on identical frequency and phase (neglecting 
cable propagation delays), and hence it follows that all related audio clock signals take on identical frequency and 
phase. The switch 5220 in Figure 52 allows the local system phase locked loop to synchronise from a different source, 
such as an internal fixed-frequency oscillator, or a "house clock" timing reference. In a link according to the present 

30 technique, only one of the devices at either end of the link may synchronise to the received clock signal (as illustrated 
in Figure 52), otherwise a timing loop would occur as the phase locked loops in the two devices attempt to synchronise 
to each other without a stable timing reference. 

[0192] Figure 53 schematically illustrates a hardware arrangement used to achieve audio latency control using the 
audio stream synchronisation markers (ASSMs) according to the development of the present technique (see Figures 

35 46 A and B). The arrangement comprises a first transceiver device 5310 having an ASSM embedding unit 5312 and 
a PHY processing unit 5314 that performs buffering, encoding and interleaving of the data. A second transceiver device 
5330 comprises a PHY unit 5352 operable to de-interleave and decode audio data received from the first transceiver 
531 0 across the link and further comprises an ASSM detection module 5354, a buffer control unit 5356, a monostable 
timer 5358 and a buffer 5359 operable to buffer audio data prior to outputting it to the audio device associated with the 

40 second transceiver 5350. The monostable timer 5358 supplies an fs/2048 indicator from the device associated with 
the second transceiver 5350 to the buffer control unit 5359. 

[01 93] The audio stream synchronisation markers (ASSMs) effectively provide a means for the clock synchronisation 
system of Figure 52 to "hook onto" the audio streams, thus controlling the audio latency. An ASSM is "embedded" in 
the audio stream by the ASSM embedding unit 5312, at the point where it is first clocked into the transceiver from the 

^5 local system. The ASSM is generated in dependence on the local fs/2048 indicator as shown in the first transceiver 
device 5310. The ASSM remains attached to the specified audio sample throughout the link, until it reaches the output 
buffer 5359 in the receiving section of the remote link. This buffer 5359 is controlled, such that the marked audio sample 
is output a specified time after the fs/2048 indicator occurred. Since the ASSM was originally generated in dependence 
on the fs/2048 indicator, and the fs/2048 indicators at each end of the link are effectively identical, the audio sample 

50 is output a specific time after it entered the other end of the HRMAI link. This is deterministic latency control. 

[0194] Once the synchronisation system of Figure 52 is locked, the audio stream synchronisation mechanism aligns 
the output buffer as described above with reference to Figure 53, to achieve the required latency. This process occurs 
once only at start-up, after which the latency remains constant for as long as the synchronisation system remains 
locked. ASSM are indicated by arranging frame transmission at start-up such that the audio sample associated with 

55 the ASSM is always located at the head of a frame payload. The frame in which this occurs has a specific Frame 
Format ID flag asserted (octet 31 , bit 0 of the frame of Figure 44). 

[0195] Whenever an ASSM-marked audio sample is found at the receiver, it should coincide with the local fs/2048 
indicator - if not, it indicates that a synchronisation error has occurred. However, such a synchronisation fault, causing 
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misalignment of ASSM-marked samples with the fs/2048 indicator, should always result in a clock error condition due 
to loss of the lock condition of the phase locked loop or an incorrect phase indicator interval. Either of these conditions 
will cause a link reset, to correct the problem. 

[0196] The invention may be embodied in software, programmable hardware (e.g. FPGA, ASIC), hardware or a 
combination of these. In the case of a software component, the invention also includes a providing (e.g. storage, 
transmission) medium by which such software is provided. 



Claims 



1 . A data communications system for communicating a data signal formed of successive data elements, the system 
comprising a transmission node; a reception node; and a link providing a data connection from the transmission 
node to the reception node; 

the transmission node comprising: 

means for transmitting a synchronisation clocking signal to the reception node via the link, the synchronisation 
clocking signal having synchronising features occurring at a frequency lower than a data element rate; and 
means for assembling elements of the data signal into data frames, each data frame having a plurality of 
successive data elements of the data signal, for transmission to the reception node via the link, the assembling 
means being responsive to the synchronisation clocking signal so as to set a synchronisation flag associated 
with a data element having a first predetermined temporal relationship with a synchronising feature of the 
synchronisation clocking signal; 
and the reception node comprising: 

means for detecting a synchronising feature of the synchronisation clocking signal received from the trans- 
mission node; 

means for disassembling received data frames to regenerate the data signal, the disassembling means 

being operable to detect a data element associated with a set synchronisation flag; 

means for outputting a data element associated with a set synchronisation flag at a second predetermined 

temporal relationship with respect to the synchronising feature of the received synchronisation clocking 

signal; 

the first and second predetermined temporal relationships being arranged so that a predetermined system 
latency exists between input of a data element to the transmission node and subsequent output of that 
data element by the reception node. 

2. A system according to claim 1 , in which the assembling means is operable: 

i. to set a synchronisation flag associated with a data frame containing a data element having a first predeter- 
mined temporal relationship with a synchronising feature of the synchronisation clocking signal; and 

ii. to position such a data element at a predetermined position within that data frame. 

3. A system according to claim 2, in which the predetermined position is a first-transmitted data element position 
within that data frame. 

4. A system according to any one of the preceding claims, in which: 

the transmission node comprises means for transmitting a data clock to the receiving node via the link, the 
data clock defining the timing of the data elements or components of the data elements; and 
the reception node comprises means for receiving the data clock from the transmitting node and for outputting 
the data elements in accordance with the received data clock. 

5. A system according to claim 4, in which said data clock transmitter is operable to transmit a Multipoint Low-Voltage 
Differential Signalling signal to said receiving node. 

6. A system according to claim 4 or claim 5, in which: 

the transmission node comprises means for combining the synchronisation clocking signal and the data clock 
to form a multiplexed clock signal for transmission to the reception node via the link; and 
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the reception node comprises means for demultiplexing the synchronisation clocking signal and the data clock 
from the multiplexed clock signal. 

7. A system according to claim 6, in which the combining means comprises means for adjusting the timing of a subset 
of clock pulses of the data clock signal in dependence on a synchronising feature of the synchronisation clocking 
signal. 

8. A system according to claim 7, in which: 

the data clock is defined with respect to a periodic reference clock edge; the transmission node is operable 
to adjust the timing of one or more clock edges of the data clock other than the reference edges in response 
to a synchronising feature of the synchronisation clocking signal; and 

the reception node comprises means for detecting timing deviations in clock edges of the data clock other 
than the reference edges. 

9. A system according to any one of the preceding claims, in which the transmission node is responsive to an externally 
supplied synchronisation clocking signal. 

10. A system according to any one of the preceding claims, in which the outputting means comprises a time delay 
arrangement, so that data elements from a data frame associated with a set synchronisation flag are output a 
predetermined delay time after the reception node receives the synchronising feature of the synchronisation clock- 
ing signal. 

11. A system according to claim 10, in which the predetermined delay time is substantially equal to a latency time 
required by the transmission node and the reception node to handle a data element for transmission via the link. 

12. A system according to any one of the preceding claims, 

the transmission node having: 

a frame assembly arrangement operable to receive input data elements at an input data rate and to buffer the 
input data elements prior to performing a frame assembly operation in which buffered data is retrieved and 
assembled to form the framed data, the frame assembly arrangement being operable to output the framed 
data for transmission at a framed data rate; 
and the receiving node having: 

a frame receiving arrangement operable to receive framed data from the transmission node at the framed 
data rate and to buffer the received framed data prior to performing frame disassembly to produce output 
data elements at an output data rate; 

in which output of framed data is commenced by the frame assembly arrangement prior to assembly of 
a complete frame and output of data blocks is commenced by the frame receiving arrangement prior to 
disassembly of a complete frame of received framed data. 

13. A system according to any one of the preceding claims, in which the data elements are samples of a one-bit signal. 

14. A system according to any one of claims 1 to 12, in which the data elements are plural-bit data words. 

15. A system according to any one of the preceding claims, in which the data elements comprise audio samples. 

16. A system according to claim 15 as dependent on claim 14, in which the data elements are derived from AES3 
standard audio sample subframes. 

17. A system according to claim 15 as dependent on claim 13, in which said data elements are derived from one-bit, 
delta-sigma modulated audio samples. 

18. A system according to claim 4 or any one of claims 6 to 17 as dependent on claim 4, in which: 

the data clock defines the timing of individual data bits of each data word; 

the transmission node and the reception node operate in accordance with a word clock, being a sub-multiple 
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of the data clock, to define the timing of individual data words. 

19. A system according to claim 18, in which the synchronising feature of the synchronisation clocking signal has a 
constant temporal relationship to the word clock. 

20. A system according to claim 19, in which the reception node comprises means for deriving the word clock from 
the synchronising features of the synchronisation clocking signal. 

21. A system according to any one of the preceding claims, in which the link is a wired link. 

22. A system according to any one of the preceding claims, in which the link comprises the physical layer of an Ethernet 
link. 

23. A transmission node for use in a data communications system for communicating a data signal formed of succes- 
sive data elements having a reception node; and a link providing a data connection from the transmission node to 
the reception node; 

the transmission node comprising: 

means for transmitting a synchronisation clocking signal to the reception node via the link, the synchronisation 
clocking signal having synchronising features occurring at a frequency lower than a data element rate; and 
means for assembling elements of the data signal into data frames, each data frame having a plurality of 
successive data elements of the data signal, for transmission to the reception node via the link, the assembling 
means being responsive to the synchronisation clocking signal so as to set a synchronisation flag associated 
with a data element having a first predetermined temporal relationship with a synchronising feature of the 
synchronisation clocking signal. 

24. A reception node for use in a data communications system for communicating a data signal formed of successive 
data elements having a transmission node and a link providing a data connection from the transmission node to 
the reception node; 

the reception node comprising: 

means for detecting a synchronising feature of the synchronisation clocking signal received from the trans- 
mission node, the synchronisation clocking signal having synchronising features occurring at a frequency 
lower than a data element rate; 

means for disassembling received data frames to regenerate the data signal, the disassembling means being 
operable to detect a data element associated with a set synchronisation flag; 

means for outputting a data element associated with a set synchronisation flag at a second predetermined 
temporal relationship with respect to the synchronising feature of the received synchronisation clocking signal. 

25. A data communications method for communicating a data signal formed of successive data elements, the system 
comprising a transmission node; a reception node; and a link providing a data connection from the transmission 
node to the reception node; the method comprising the steps of: 

the transmission node: 

transmitting a synchronisation clocking signal to the reception node via the link, the synchronisation clock- 
ing signal having synchronising features occurring at a frequency lower than a data element rate; 
assembling elements of the data signal into data frames, each data frame having a plurality of successive 
data elements of the data signal, for transmission to the reception node via the link, the assembling step 
being responsive to the synchronisation clocking signal so as to set a synchronisation flag associated with 
a data element having a first predetermined temporal relationship with a synchronising feature of the 
synchronisation clocking signal; 

and the reception node: 

detecting a synchronising feature of the synchronisation clocking signal received from the transmission 
node; 

disassembling received data frames to regenerate the data signal, the disassembling step being operable 
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to detect a data element associated with a set synchronisation flag; and outputting a data element asso- 
ciated with a set synchronisation flag at a second predetermined temporal relationship with respect to the 
synchronising feature of the received synchronisation clocking signal; 

s the first and second predetermined temporal relationships being arranged so that a predetermined system 

latency exists between input of a data element to the transmission node and subsequent output of that data element 
by the reception node. 

26. A method of operation of a transmission node for use in a data communications system for communicating a data 
10 signal formed of successive data elements having a reception node; and a link providing a data connection from 

the transmission node to the reception node; 
the method comprising the steps of: 

transmitting a synchronisation clocking signal to the reception node via the link, the synchronisation clocking 
signal having synchronising features occurring at a frequency lower than a data element rate; and 
assembling elements of the data signal into data frames, each data frame having a plurality of successive 
data elements of the data signal, for transmission to the reception node via the link, the assembling step being 
responsive to the synchronisation clocking signal so as to set a synchronisation flag associated with a data 
element having a first predetermined temporal relationship with a synchronising feature of the synchronisation 
clocking signal. 

27. A method of operation of a reception node for use in a data communications system for communicating a data 
signal formed of successive data elements having a transmission node and a link providing a data connection from 
the transmission node to the reception node; 

25 the method comprising-the steps of: 

detecting a synchronising feature of the synchronisation clocking signal received from the transmission node, 
the synchronisation clocking signal having synchronising features occurring at a frequency lower than a data 
element rate; 

30 disassembling received data frames to regenerate the data signal, the disassembling step being operable to 

detect a data element associated with a set synchronisation flag; and 

outputting a data element associated with a set synchronisation flag at a second predetermined temporal 
relationship with respect to the synchronising feature of the received synchronisation clocking signal. 

35 28. Computer software having program code for carrying out a method according to any one of claims 25 to 27. 

29. A providing medium by which software according to claim 28 is provided. 

30. A medium according to claim 29, the medium being a storage medium. 

40 

31. A medium according to claim 29, the medium being a transmission medium. 
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Value 


Frame type 


Oie 


Frame contains bitstream-mode audio 


lie 


Frame contains AES3-compatible mode audio 


(other) 


(reserved - frame shall be disregarded by receiver) 


Fig. 44B 


Bit 


Frame indication 


0 


0: (no flag) 




1 : first sample in frame is associated with an Audio sample 
synchronisation marker (see section 9.3) 


3:1 


(reserved: transmitted as zero, disregarded by receiver) 


Fig. 44C 


Bit 


Frame indication 


0 


0: (no flag) 




1: first sample in frame is associated with an Audio sample 
synchronisation marker (see section 9.3) 


3:1 


(reserved: transmitted as zero, disregarded by receiver) 



Fig. 44D 
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Flag indication 


61 16 


Frame contains bit-stream audio: 1-bit samples, 64f s sample rate 


7116 


Frame contains bit-stream audio: 1-bit samples, 128f s sample rate 


(other) 


(reserved - frame shall be disregarded by receiver) 



Fig. 44E 
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Flag indication ~ 


3:0 


Sample word length: (unsigned 4-bit integer, n) 




sample word length = 4n 


5:4 


Sample frequency multiplier: 




00 2 : 1x base frequency 




01 2 : 2x base frequency 




10 2 :4x base frequency 




11 2 :8x base frequency 


6 


Base sample frequency: 




0: 44,1 kHz 




1:48 kHz 
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Variable sample frequency indicator: 




0: Maximum sample frequency deviation = 100 ppm 




1: Maximum sample frequency deviation = 12,5 % 



Fig. 44F 
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